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BACTERICIDAL/PERMEABILITY-INCREASING PROTEIN: 
CRYSTALLIZATION, X-RAY DIFFRACTION, THREE- 
DIMENSIONAL STRUCTURE DETERMINATION, 
RATIONAL DRUG DESIGN AND MOLECULAR MODELING OF 

RELATED PROTEINS 



The present invention generally pertains to the fields of protein crystallization, 
x-ray diffraction analysis, three-dimensional structural determination, rational drug 

10 design and molecular modeling of related proteins. The present invention solves the 
three-dimensional structure of bactericidal/permeability-increasing protein (BPI) and 
provides crystallization methods for BPI protein products. A crystallized BPI protein 
product was physically analyzed by x-ray diffraction techniques. The resulting x-ray 
diffraction patterns were of sufficiently high resolution to be useful for determining 

15 the three-dimensional structure of BPI and have yielded atomic coordinates for BPI. 
The present invention relates to uses of BPI coordinates for molecular modeling of 
related proteins and rational drug design (RDD) of mimetics and ligands for BPI and 
for related proteins. The present invention also relates to atomic coordinates of BPI, 
or portions thereof, to solve crystal forms of BPI proteins or their fragments, analogs, 

20 and variants thereof, or of related proteins, including lipid transfer proteins, or their 
fragments, analogs and variants. 

Part of the work performed during development of this invention utilized U.S. 
Government funds. The U.S. Government has certain rights in this invention. 



the granules of mammalian polymorphonuclear leukocytes (PMNs or neutrophils), 
which are blood cells essential in the defense against invading microorganisms. BPI 
is known to bind the lipopoly saccharide (LPS), a major component of the outer 
membrane of gram-negative bacteria that stimulates a potent inflanmiatory response. 
30 Human BPI protein has been isolated from PMNs by acid extraction combined with 
either ion exchange chromatography [Elsbach, J. Biol. Chem,. 254:11000 (1979)] or 
£. coli affinity chromatography [Weiss, et al.. Blood. 69:652 (1987)]. BPI obtained 



5 




Background of the Invention 

Bactericidal/permeability-increasing protein (BPI) is a protein isolated from 
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in such a manner is referred to herein as natural BPI and has been shown to have 
potent bactericidal activity against a broad spectrum of gram-negative bacteria. The 
molecular weight of human BPI is approximately 55,000 daltons (55 kD). The amino 
acid sequence of the entire human BPI protein and the nucleic acid sequence of DNA 

5 encoding the protein have been reported in Figure 1 of Gray et al., J, Biol. Chem,, 
264:9505 (1989), incorporated herein by reference. The Gray et al. amino acid 
sequence is set out in SEQ ID NO: 1 hereto. U.S. Patent No. 5,198,541, EP0375724 
and W089/ 10486 (PCT/US88/02700) disclose recombinant genes encoding and 
methods for expression of BPI proteins, including BPI holoprotein and fragments of 

10 BPI. 

A proteolytic N-terminal fragment of BPI having a molecular weight of about 
25 kD possesses essentially all the anti-bacterial efficacy of the naturally-derived 55 
kD human BPI holoprotein. [Ooi ei al., J. Bio. Chem., 262: 14891-14894 (1987)]. 
In contrast to the N-tenninal portion, the C-terminal region of the isolated human BPI 

15 protein displays only slightly detectable anti-bacterial activity against gram-negative 
organisms. [Ooi et al., 7. Exp. Med,, 174:649 (1991).] An N-terminal BPI fragment 
of approximately 23 kD, referred to as "rBPIjj," has been produced by recombinant 
means and also retains anti-bacterial activity against gram-negative organisms. 
[Gazzano-Santoroetai., Infect. Immun, 60:4754-4761 (1992).] An N-terminal analog 

20 of BPI, rBPIii, has been produced as described in Horwitz et al.. Protein Expression 
Purification, 8:28-40 (1996). 

The bactericidal effect of BPI has been reported to be highly specific to gram- 
negative species, e.g., in Elsbach and Weiss, Inflammation: Basic Principles and 
Clinical Correlates, eds. Gallin et al.. Chapter 30, Raven Press, Ltd. (1992). This 

25 reported target cell specificity was believed to be the result of the strong attraction 
of BPI for LPS on the outer membrane (or envelope) of gram-negative organisms. 
Although BPI was commonly thought to be non-toxic for other microorganisms, 
including yeast and for higher eukaryotic cells, it has recently been discovered that 
BPI protein products exhibit activity against gram-positive bacteria, mycoplasma, 

30 mycobacteria, fungi, protozoa and chlamydia. 

The precise mechanism by which BPI kills gram-negative bacteria is not yet 
completely elucidated, but it is believed that BPI must first bind to the surface of the 



wo 98/58961 



m 

PCT/US98/13007 



3 

bacteria through electrostatic and hydrophobic interactions between the caiionic BPI 
protein and negatively charged sites on lipopoly saccharide. Bacterial LPS has been 
referred to as "endotoxin" because of the potent inflammatory response that it 
stimulates, i.e., the release of mediators by host inflanunatory cells which may 
5 ultimately result in irreversible endotoxic shock. BPI binds to lipid A, reponed to 
be the most toxic and most biologically active component of LPS. 

In susceptible gram-negative bacteria, BPI binding is thought to disrupt LPS 
structure, leading to activation of bacterial enzymes that degrade phospholipids and 
peptidoglycans, altering the permeability of the cell's outer membrane, and initiating 

10 events that ultimately lead to cell death. [Elsbach and Weiss (1992), Jt/pra]. BPI has 
been proposed to act in two stages. The first stage proposed is a sublethal one that 
is characterized by inunediate growth arrest, permeabilization of the outer membrane 
and selective activation of bacterial enzymes that hydrolyze phospholipids and 
peptidoglycans. Bacteria at this stage could be rescued by growth in serum albumin 

15 supplemented media [Mannion et al., 7. Clin, Invest., 55:853-860 (1990)]. The 
second stage, defined by growth inhibition that cannot be reversed by serum albumin, 
is proposed to occur after prolonged exposure of the bacteria to BPI and characterized 
by extensive physiologic and structural changes, including apparent damage to the 
inner cytoplasmic membrane. 

20 Initial binding of BPI to LPS leads to organizational changes that probably 

result from binding to the anionic groups of LPS, which normally stabilize the outer 
membrane through binding of Mg** and Ca"^*. Attachment of BPI to the outer 
membrane of gram-negative bacteria produces rapid permeabilization of the outer 
membrane to hydrophobic agents such as actinomycin D. Binding of BPI and 

25 subsequent gram-negative bacterial killing depends, at least in part, upon the LPS 
polysaccharide chain length, with long O-chain bearing, "smooth" organisms being 
more resistant to BPI bactericidal effects than short O-chain bearing, "rough" 
organisms [Weiss et al., 7. Clin. Invest. 65: 619-628 (1980)]. This permeabilization 
of the gram-negative outer envelope is reversible upon dissociation of the BPI, a 

30 process requiring high concentrations of divalent cations and synthesis of new LPS 
[Weiss et al., 7. Immunol. 132: 3109-3115 (1984)]. Loss of gram-negative bacterial 
viability, however, is not reversed by processes which restore the envelope integrity. 
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suggesting that the bactericidal action is mediated by additional lesions induced in the 
target organism and which may be siniated at the cytoplasmic membrane (Mannion 
et al., y. Clin. Invest. 86: 631-641 (1990)). Specific investigation of this possibility 
has shown that on a molar basis BP! is at least as inhibitory of cytoplasmic membrane 
5 vesicle function as polymyxin B but the exact mechanism as well as the relevance of 
such vesicles to studies of intact organisms was not elucidated (In't Veld, et al., 
Infeaion and Immunity 56: 1203-1208 (1988)). 

BPI is a member of a gene/protein family of lipopolysaccharide binding and 
lipid transfer proteins whose other currently known members include 
10 lipopolysaccharide binding protein (LBP), cholesteryl ester transfer protein (CETP) 
and phospholipid transfer protein (PLTP). 

BPI protein products (which include naturally and recombinantly produced BPI 
protein; natural, synthetic, and recombinant biologically active polypeptide fragments 
of BPI protein; biologically active polypeptide variants of BPI protein or fragments 
15 thereof, including hybrid fusion proteins and dimers; biologically active polypeptide 
analogs of BPI protein or fragments or variants thereof, including cysteine-substituted 
analogs; and BPI-derived peptides) have been demonstrated to have a variety of 
beneficial activities. BPI protein products are known to be bactericidal for gram- 
negative bacteria, as described in U.S. Patent Nos. 5,198,541 and 5,523,288, both 
20 of which are incorporated herein by reference. BPI protein products are also known 
to enhance the effectiveness of antibiotic therapy in gram-negative bacterial infections, 
as described in U.S. Patent No. 5,523,288 and corresponding International 
Publication No. WO 95/08344 (PCT/US94/ 11225), which are incorporated herein by 
reference. BPI protein products are also known to be bactericidal for gram-positive 
25 bacteria and mycoplasma, and to enhance the effectiveness of antibiotics in gram- 
positive bacterial infections, as described in U.S. Patent No, 5,578,572 and 
corresponding International Publication No. WO 95/19180 (PCT/US95/00656), which 
are incorporated herein by reference. BPI protein products are further known to 
exhibit anti-fungal activity, and to enhance the activity of other anti-fungal agents, as 
30 described in U.S. Patent No. 5,627,153 and corresponding International Publication 
No. WO 95/19179 (PCT/US95/00498), and further as described for anti-fungal 
peptides in co-pending U.S. Application Serial No. 08/621,259 filed March 21, 1996, 
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which is in mm a continuation-in-part of U.S. Application Serial No. 08/504,841 
filed July 20, 1994 and corresponding International Publication Nos. WO 96/08509 
(PCT/US95/09262) and WO 97/04008 (PCT/US96/03845), all of which are 
incorporated herein by reference. BPI protein products are further known to exhibit 

5 anti-protozoan activity, as described in U.S. Patent No. 5,646, 1 14 and corresponding 
International Publication No. WO 96/01647 (PCT/US95/08624). all of which are 
incorporated herein by reference. BPI protein products are known to exhibit anti- 
chlamydial activity, as described in co-pending U.S. Application Serial No. 
08/694,843 filed August 9, 1996 and corresponding International Publication No. WO 

10 98/06415 (PCT/US97/13810), all of which are incorporated herein by reference. 
Finally, BPI protein products are known to exhibit anti-mycobacterial activity, as 
described in co-pending U.S. Application Serial No. 08/626,646 filed April 1, 1996, 
which is in turn a continuation of U.S. Application Serial No. 08/285,803 filed 
August 14, 1994, which is in turn a continuation-in-part of U.S. Application Serial 

15 No. 08/031.145 filed March 12, 1993 and corresponding International Publication 
No. WO94/20129 (PCT/US94/02463), all of which are incorporated herein by 
reference. 

The effects of BPI protein products in humans with endotoxin in circulation, 
including effects on TNF, IL-6 and endotoxin are described in U.S. Patent Nos. 

20 5,643,875 and 5,753,620 and corresponding International Publication No. WO 
95/19784 (PCT/US95/01151), all of which are incorporated herein by reference. 

BPI protein products are also known to be useful for treatment of specific 
disease conditions, such as meningococcemia in humans (as described in co-pending 
U.S. Application Serial No. 08/644,287 filed May 10, 1996 and corresponding 

25 International Publication No. WO 97/42966 (PCT/US97/08016), which are 
incorporated herein by reference), hemorrhagic trauma in humans, (as described in 
co-pending U.S. Application Serial No. 08/862,785, a continuation-in-part of U.S. 
Serial No. 08/652,292 filed May 23, 1996, now U.S. Patent No. 5,756,464, and 
corresponding International Publication No. WO 97/44056 (PCT/US97/08941), all 

30 of which are incorporated herein by reference), bum injury (as described in U.S. 
Patent No. 5,494,896 and corresponding International Publication No. WO 96/30037 
(PCT/US96/02349), both of which are incorporated herein by reference). 
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ischemia/reperfusion injury (as described in U.S. Patent No. 5,578,568, incorporated 
herein by reference), and liver resection (as described in co-pending U.S. Application 
Serial No. 08/582,230 filed March 16. 1998 which is a continued prosecution 
application of the same serial no. filed January 3, 1996, which is in turn a 

5 continuation of U.S. Application Serial No. 08/318,357 filed October 5. 1994, which 
is in turn a continuation- in-pan of U.S. Application Serial No, 08/132,510 filed 
October 5, 1993, and corresponding International Publication No. WO 95/10297 
(PCT/US94/ 11404), all of which are incorporated herein by reference). 

BPI protein products are also known to neutralize the anti-coagulant activity 

10 of exogenous heparin, as described in U.S. Patent No. 5,348,942, incorporated herein 
by reference, as well as to be useful for treating chronic inflammatory diseases such 
as rheumatoid and reactive arthritis, as described in U.S. Patent No. 5,639,727, 
incorporated herein by reference, and for inhibiting angiogenesis and for treating 
angiogenesis-associated disorders including malignant tumors, ocular retinopathy and 

15 endometriosis, as described in co-pending U.S. Application Serial Nos. 08/435,855, 
08/466,624 and 08/466,826, and corresponding International Publication No. WO 
94/20128 (PCT/US94/02401), all of which are incorporated herein by reference. 

BPI protein products are also known for use in antithrombotic methods, as 
described in U.S. Patent No. 5,741,779 and corresponding International Publication 

20 No. W097/42967 (PCT/US97/08017), which are incorporated herein by reference. 

U.S. Patent Nos. 5,420,019 and 5,674,834 and corresponding International 
Publication No. W094/18323 (PCT/US94/01235), all of which are incorporated 
herein by reference, disclose that the replacement of the cysteine residue at amino 
acid position 132 or 135 with another amino acid renders the resulting BPI 

25 polypeptide resistant to dimerization and cysteine adduct formation. It also discloses 
that terminating the N-terminal BPI fragment at BPI amino acid position 193 resulted 
in an expression product with reduced carboxy-terminal heterogeneity. 

Because of the multiplicity of valuable activities and uses of BPI protein 
products as exemplified above, a need continues to exist for new products with 

30 strucmres based on or mimicking a BPI protein product and having one or more of 
the activities and/or uses of BPI protein products, including use as anti-infective 
products, including antimicrobial agents (e.g., gram-negative bacteria [U.S. Patent 
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Nos. 5,198,541 and 5,523,288; WO95/08344 (PCT/US94/ 11225)] and gram-positive 
bacteria [U.S. Patent No. 5,578,572; WO95/19180 (PCT/US95/00656)], fungi [U.S. 
Patent No. 5,627,153; WO95/19179(PCT/US95/00498)], mycobacteria [EP0690721; 
WO94/20129 (PCT/US94/02463)] and chlamydia [WO96/01647 (PCT/US95/08624)] 
5 and endotoxin binding/neutralizing agents [WO95/019784 (PCT/US95/01151)], and 
as heparin binding/neutralizing products [U.S. Patent Nos. 5,348.942 and 5,639,727; 
WO94/20128 (PCT/US94/02401)]. including for the neutralization of exogenously 
administered heparin, inhibition of angiogenesis (normal or pathological) for the 
treatment of chronic inflammatory disease states, and anticoagulant and thrombolytic 
10 agents for the treatment of thrombotic disorders [PCT/US97/08017]. All of the 
above-listed references regarding biological or functional activities of BPI, as well as 
^ therapeutic and diagnostic uses of BPI, are hereby incorporated by reference. One 

=P avenue of investigation toward solving the problem of new products based on BPI and 

J fulfilling this need is the determination of the crystal structure of a BPI protein 

4= 15 product. 

Ui Summary of the Invention 

P The present invention solves the above problem and fulfills the need for 

H designating and making new and useful products based on BPI. It is an object of this 

invention to solve the three-dimensional structure of BPI and thereby provide the 

D 20 atomic coordinates (i.e., structure coordinates) of BPI from the analysis of x-ray 

o 

diffraction patterns of sufficiently high resolution to be useful for determining the 
three-dimensional protein strucmre. 

It is an object of this invention to provide methods of expressing, purifying 
and crystallizing bactericidal/permeability-increasing protein (BPI) products, and 
25 thereby provide crystallized BPI proteins. 

It is an object of this invention to provide the use of the structure coordinates 
of a BPI crystal to allow the design of compounds for mimicking a BPI protein 
product to reveal the atomic details of ligand binding sites of BPI ie,g, lipid-like or 
heparin-like molecules). 
30 It is an object of this invention to provide use of the structure coordinates of 

a BPI crystal as described herein to solve the crystal structure of a crystal of a 
different BPI protein or fragment, analog or variant thereof, or a crystal of a related 
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protein, including a BPI-related lipid transfer protein or a fragment, analog or variant 
thereof. 

It is an object of this invention to provide mutants of BPI or fragments, 
analogs, or variants thereof characterized by one or more different propenies as 
compared with wild-type BPI. These propenies include altered surface charge, 
altered lipid binding pockets, altered specificity or higher activity. BPI mutants are 
useful to identify those amino acids that are most important for the lipid and heparin 
binding activity and other biological activities of BPI. This information, in turn, 
allows the design of new structures with one or more different properties based on 
BPI. 

It is an object of this invention to provide the use of the structure coordinates 
and atomic details of BPI as described herein or its fragments, analogs or variants 
(including mutants or co-complexes) or of a BPI-related lipid transfer protein or its 
fragments, analogs or variants (including mutants or co-complexes) to design, 
evaluate computationally, synthesize and use new structures based on BPI with 
desirable properties of BPI, such as physical and pharmacological properties. 

X-ray diffraction patterns of a related protein can be analyzed directly to 
provide the three-dimensional structure (if of sufficiently high resolution), however, 
the atomic coordinates for the crystallized BPI, as provided herein, can be used for 
strucmre determination. The x-ray diffraction patterns obtained by methods of the 
present invention, and provided on computer readable media, are used to provide 
electron density maps. The amino acid sequence is also useful for three-dimensional 
structure determination. The data is then used in combination with phase 
determination (e.g. using multiple isomorphous replacement (MIR) molecular 
replacement techniques) to generate electron density maps of BPI, using a suitable 
computer system. The electron density maps, provided by analysis of either the x-ray 
diffraction patterns or working backwards from the atomic coordinates, provided 
herein, are then fitted using suitable computer algorithms to generate secondary, 
tertiary and/or quaternary structures and/or domains of BPI, which structures and/or 
domains are then used to provide an overall three-dimensional structure, as well as 
binding and/or active sites of BPI. 
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It is also an object of this invetition to specifically provide for the use of three- 
dimensional modeling of BPI and other members of the BPI protein family using the 
coordinates from the x-ray diffraction patterns. The coordinates and amino acid 
sequences are entered into one or more computer programs for molecular modeling. 
5 Such molecular modeling programs generate atomic coordinates that reflect the 
secondary, teniary and/or quaternary strucmres of the protein which contribute to its 
overall three-dimensional structure and provide information related to binding and/or 
active sites of the protein. 

It is a further object of this invention to specifically provide for the use of 

10 similar molecular modeling for rational drug design (RDD) of mimetics and ligands 
of BPI and other members of the BPI protein family. The drug design paradigm uses 
computer modeling programs to determine potential mimetics and ligands which are 
expected to interact with sites on the protein. The potential mimetics or ligands are 
then screened for activity and/or binding. For BPI-related mimetics or ligands, 

15 screening methods can be selected from assays for at least one biological activity of 
BPI, e.g., anti-microbial, LPS- binding/neutralizing, heparin binding/neutralizing, 
and/or anti-thrombotic activities, according to known method steps. Similarly for 
LBP-, CETP- or PLTP-related mimetics or ligands, such screening methods can be 
selected from assays for at least one biological activity of LBP, CETP or PLTP, 

20 according to known method steps. The resulting mimetics or ligands are then 
provided by methods of the present invention and are useful for treating, inhibiting 
or preventing BPI-modulated diseases (or LBP-, CETP- and PLTP-modulated) in 
animals, including humans. 

Thus, as described herein, the present invention provides use of atomic 

25 coordinates of a BPI protein, or fragment, analog or variant thereof, to model a BPI 
protein or a related protein, including a BPI-related lipid transfer protein, such as 
LBP, CETP or PLTP, or fragment, analog or variant thereof. 

The present invention also provides use of atomic coordinates of a BPI protein 
wherein the BPI protein comprises a binding site characterized by amino acid residues 

30 of at least one binding pocket as defined in Table 3, and/or wherein the BPI protein 
comprises a binding site characterized by at least one amino acid sequence, or variant 
of the sequence, selected from positions about 17 to about 45, positions about 65 to 
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about 99 or positions about 142 to about 169 of BPI, or alternatively, a binding site 
characterized by at least one amino acid sequence, or variant of the sequence, selected 
from positions about 36 to about 54, positions about 84 to about 109 or positions 
about 142 to about 164 of BPI. 
5 The present invention provides use of atomic coordinates of a BPI protein to 

computationally design a chemical compound for mimicking a BPI protein, or 
fragment, analog or variant thereof, or a BPI-related lipid uransfer protein, or 
fragment, analog or variant thereof, including, for example, lipopolysaccharide- 
binding protein (LBP), cholesteryl ester transferase protein (CETP) or phospholipid 

10 transfer protein (PLTP), or fragment, analog, or variant thereof. 

The present invention also provides use of atomic coordinates of BPI protein 
to design a chemical compound capable of associating with a BPI-related lipid binding 
protein, or fragment, analog or variant thereof, including, for example, 
bactericidal/permeability-increasing protein (BPI), lipopolysaccharide-binding protein 

15 (LBP), cholesteryl ester transferase protein (CETP) or phospholipid transfer protein 
. (PLTP), or fragment, analog or variant thereof. 

The present invention provides use of atomic coordinates of a BPI protein to 
design a model of ligands in an active site of a lipid binding protein, including, for 
example, BPI protein, lipopolysaccharide-binding protein (LBP), cholesteryl ester 

20 transferase protein (CETP) or phospholipid transfer protein (PLTP), or fragment, 
analog or variant thereof. 

The present invention provides use of atomic coordinates of a 
bactericidal/permeability-increasing ("BPI") protein, to design compounds with at 
least one activity selected from the group consisting of antibacterial, antifungal, 

25 antimycobacterial, antichlamydial, antiprotozoan, heparin-binding, endotoxin-binding, 
heparin-neutralizing, endotoxin-neutralizing, inhibition of tumor and endothelial cell 
proliferation, inhibition of angiogenesis, anti-inflanmiatory. anticoagulant and 
antithrombolytic. The coordinates disclosed herein are suitable for all of the 
aforementioned uses of atomic coordinates. 

30 The present invention provides a method of three-dimensional modeling of a 

BPI protein or BPI-related lipid transfer protein comprising the steps of: (a) providing 
three-dimensional atomic coordinates derived from X-ray diffraction measurements 
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of a BPI protein in a computer readable format; (b) inputting the data from step (a) 
into a computer with appropriate software programs; and (c) generating a three- 
dimensional structural representation of the BPI protein or BPI-related lipid transfer 
protein suitable for visualization and further computational manipulation; panicularly 
5 wherein the BPI protein comprises a binding site characterized by amino acid residues 
of at least one binding pocket as defined in Table 3. or wherein the BPI protein 
comprises a binding site characterized by at least one amino acid sequence, or variant 
of the sequence, selected from positions about 17 to about 45, positions about 65 to 
about 99 or positions about 142 to about 169 of BPI or wherein the BPI protein 

10 comprises a binding site characterized by amino acid residues of at least one binding 
pocket as defined in Table 3 and a binding site characterized by at least one amino 
acid sequence, or variant of the sequence, selected from positions about 36 to about 
54, positions about 84 to about 109 or positions about 142 to about 164 of BPI, or 
alternatively from about positions 36 to about 54, from about positions 84 to about 

15 109, or about positions 142 to about 164. 

The present invention provides a method for providing an atomic model of a 
BPI protein, BPI-related lipid binding protein, or fragment, analog or variant thereof, 
comprising (a) providing a computer readable medium having stored thereon atomic 
coordinate/x-ray diffraction data of a BPI protein, or firagment, analog or variant 

20 thereof, in crystalline form, the data sufficient to model the three-dimensional 
structure of the BPI protein, or fragment, analog or variant thereof; (b) analyzing, on 
a computer using at least one subroutine executed in said computer, atomic 
coordinate/x-ray diffraction data from (a) to provide atomic coordinate data output 
defining an atomic model of said BPI protein, BPI-related binding lipid protein or 

25 fragment, analog or variant thereof, said analyzing utilizing at least one computing 
algorithm selected from the group consisting of data processing and reduction, auto- 
indexing, intensity scaling, intensity merging, amplimde conversion, truncation, 
molecular replacement, molecular alignment, molecular refinement, electron density 
map calculation, electron density modification, electron map visualization, model 

30 building, rigid body refinement, positional refinement; and (c) obtaining atomic 
coordinate data defining the three-dimensional structure of at least one of said BPI 
protein, BPI-related lipid binding protein, or firagment, analog or variant thereof; 



wo 98/58961 



PCTAJS98/13007 



12 

particularly wherein said computer readable medium further has stored thereon data 
corresponding to a nucleic acid sequence or an amino acid sequence data comprising 
at least one structural domain or functional domain of a BPI, LBP, CETP or PLTP 
corresponding to at least one BPI or mutant primary sequence or fragment, analog or 
variant thereof; and wherein said analyzing step further comprises analyzing said 
sequence data. 

The present invention provides a computer-based system for providing atomic 
model data of the three-dimensional strucmre of a BPI protein, BPI-related lipid 
binding protein or fragment, analog or variant thereof, comprising the following 
elements: (a) at least one computer readable medium (CRM) having stored thereon 
atomic coordinate/ x-ray diffraction data of a BPI protein, or fragment, analog or 
variant thereof; (b) at least one computing subroutine that, when executed in a 
computer, causes the computer to analyze atomic coordinate/x-ray diffraction data 
from (a) to provide atomic coordinate data output defining an atomic model of a BPI 
protein, BPI-related lipid binding protein or fragment, analog or variant thereof, said 
analyzing utilizing at least one computing subroutine selected from the group 
consisting of data processing and reduction, auto-indexing, intensity scaling, intensity 
merging, amplitude conversion, truncation, molecular replacement, molecular 
alignment, molecular refinement, electron density map calculation, electron density 
modification, electron map visualization, model building, rigid body refinement, 
positional refinement; and (c) retrieval means for obtaining atomic coordinate output 
data substantially defining the three-dimensional structure of said BPI protein, BPI- 
related lipid binding protein or fragment, analog or variant thereof. 

The present invention provides a method for providing a computer atomic 
model of a ligand of a BPI protein, BPI-related lipid binding protein, or fragment, 
analog or variant thereof, comprising: (a) providing a computer readable medium 
(CRM) having stored thereon atomic coordinate data of a BPI protein, or fragment, 
analog, or variant thereof; (b) providing a CRM having stored thereon atomic 
coordinate data sufficient to generate atomic models of potential ligands of said BPI 
protein, BPI-related lipid binding protein, or fragment, analog, or variant thereof; (c) 
analyzing on a computer, using at least one subroutine executed in said computer, the 
atomic coordinate data from (a) and ligand data from (b), to determine binding sites 
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of BPI protein, BPI-related lipid binding protein, or fragment, analog,. or variant 
thereof, and to provide atomic coordinate data defining an atomic model of at least 
one ligand of said BPI protein, BPI-related lipid binding protein, or fragment, analog 
or variant thereof, said analyzing utilizing computing subroutines selected from the 
5 group consisting of data processing and reduction, auto-indexing, intensity scaling, 
intensity merging, amplitude conversion, truncation, molecular replacement, 
molecular alignment, molecular refinement, electron density map calculation, electron 
density modification, electron map visualization, model building, rigid body 
refinement, positional refinement; and (d) obtaining atomic coordinate model output 
10 data defining the three-dimensional structure of said at least one ligand of said BPI 
protein, BPI-related lipid binding protein, or fragment, analog, or variant thereof. 
J% The present invention provides a computer-based system for providing an 

«p atomic model of at least one ligand of a BPI protein, BPI-related lipid binding 

J protein, or fragment, analog or variant thereof, comprising the following elements: 

^ 15 (a) a computer readable medium (CRM) having stored thereon atomic coordinate data 

[f| of a BPI protein, fragment, analog or variant thereof; (b) a CRM having stored 

^ thereon atomic coordinate data sufficient to generate atomic models of potential 

Ci ligands of a BPI protein, BPI-related lipid binding protein, or fragment, analog or 

^ variant thereof; (c) at least one computing subroutine for analyzing on a computer, 

□ 20 the atomic coordinate data from (a) and (b), to determine binding sites of BPI protein, 

BPI-related lipid binding protein, or fragment, analog, or variant thereof, and to 
provide data output defining an atomic model of at least one potential ligand of BPI 
protein, BPI-related lipid binding protein, or fragment, analog, or variant thereof, 
said analyzing utilizing at least one computing subroutine selected from the group 
25 consisting of data processing and reduction, auto- indexing, intensity scaling, intensity 
merging, amplitude conversion, truncation, molecular replacement, molecular 
alignment, molecular refinement, electron density map calculation, electron density 
modification, electron map visualization, model building, rigid body refinement, 
positional refinement; and (d) retrieval means for obtaining atomic coordinate data of 
30 said at least one ligand of a BPI protein, BPI-related lipid binding protein, or 
fragment, analog or variant thereof. 
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said at least one ligand of a BPI protein, BPI-related lipid binding protein, or 
fragment, analog or variant thereof. 

Other objects of the invention will be apparent to one of ordinary skill in the 
art from the following detailed description and examples relating to the present 
invention. 

Brief Description of the Figures 
^ \ Fig. 1 (A\ A ribbon diagram of residues 1^56 of BPI illustrating its boomerang 
3^ Q yfehape. The^H2-terminal domain is shown; the COOH-terminal domain and the two 
phosphatidylcholine molecules are shown. The linker is also shown, and the disulfide 
bond is shown aka ball-and-stick model. (B) View after rotating (A) 70° about the 
long axis of the molecule. Figure produced with MOLSCRIPT [P. Krauliz, J. AppL 
Cryst., 24:926 (1991^] and RASTER3D [E. A. Merrit and M. E. P. Murphy, Acta 
Crystallogn, D50:889\(1994); D. J. Bacon and W. F. Anderson, 7. Mo. Graphics, 
6:219 (1988)]. 

Fig. 2 (A) Schematic driving of the novel BPI domain fold, shown in same 
orientation as the NHz-terminal domain in Fig. IB (B) Superposition of the NHj- and 
COOH-terminal domains of BPPshowing the overall topological similarity. Residues 
1 to 230 and 250 to 456 are shown. The NH2-terminal domain is in the same 
orientation as Fig. lA. 



15 




20 Rg. 3 Electron density of the final\2.8 A MIR map contoured at 1.0 a and 
superimposed on the refined model. TheVarea shown is in the lipid binding pocket 
of the hfHi-terminal domain of BPI. The f>hosphatidylcholine and the surrounding 
protein atoms are shown. \ 

Fig. 4 (A) The covalent structure of phosphatidylcholine and the lipid A region of 
25 LPS from £. coli and S. ryphimurium. Phosphate groups are indicated by P. 
Adapted with changes from [C. R. H. Raetz, AnnuUlev. Biochem, 59:129 (1990)]. 
(B) Slice through the interior of BPI showing the liW binding pocket in the NHj- 
terminal domain. The solvent accessible surface of theWotein was calculated without 
lipid present, the interior of die protein and the phoi|Aatidylcholine are shown. 
30 Protein residues are shown as ball-and-stick. \ 
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Figure produced with MSP\[M. L. Connolly, Science, 227:709 (1983); M. L. 
Connolly . 7. Am. Chem. Soc\l07:in8 (1985)]. 

Figs. 5 (A)\and 5 (B) The amino acid sequences of human BPI. LBP, PLTP, and 
^CETP. The\alignment was performed with CLUSTAL [D. G. Higgins and P. M. 
Sharp, Gene\ 75:237 (1989)] using all eleven known protein sequences from 
manmials [R. R, Schuman. et al.. Science, 249:1429 (1990); D. Drayna et al.. 
Nature, i27:632\(1987); R. Day et al., 7. Biol. Chem., 269:9388 (1994); S.R. Leong 
and T. Cameratk Nucleic Acids Res., 75:3052 (1990); M. Nagashima, J. W. 
McLean, R. M. iJawn, J. Lipid Res., 29:1643 (1988); M. E. Pape, E, F. Rehber, K. 
10 R. Marotti, G. W. Melchior, Anheriosclerosis 77:1759 (1991); G. Su et al., 7. 
' Immunol., 755:743\ (1994); P. W. Gray et al., 7. Biol. Chem. 264: 9505 (1989); 
Albers et al., Biomem. Biophys. Acta, 1258:21 (1995); X, C. Jiang et al.. 
Biochemistry, 34:725& (1995); L. B. Agellon et al.. Biochemistry, 29:1372 (1990); 
X. C. Jiang et al., V. Biol. Chem., 266:4631 (1991)] but only the four human 
15 sequences are shown. \ Residues that are completely conserved in all proteins are 
indicated below the sequence *; those which are highly conserved are indicated by 
•. The secondary stniccure of BPI is indicated above the sequences. The 0 strands 
are indicated by arrows; strands which make up the central 0 sheet are shown with 
gray arrows. Because of the )3 bulges and pronounced twisting, some of the 0 strands 
20 have one or more residues Vhat do not show classical H-bonding patterns or 

angles; these breaks are indicated by ' above the strands. The a helices are shown 
as cylinders, and one-residue Breaks in helices B and B' are indicated with a vertical 
dashed line. The horizontal daihed line indicates the linker region. Peptides from 
BPI and LBP with the highest LPSi-binding activity (Little, et al., 7. Biol. Chem. 268: 
25 1865 (1994); Taylor et al., 7. BidL Chem. 270: 17934 (1995)) are in bold italics. 
The disulfide bond is indicated by S^. Residues with atoms within 4 A of the NHj- 
terminal lipid are highlighted with gray shading; residues within 4 A of the COOH- 
terminal lipid are shown with white letters in black boxes. 

Fig, 6 Block diagram of a computer system 102 that can be used to implement the 
30 present invention. The computer system 102 includes a processor 106 coimected to 
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a bus 104. Also connected to the bus 104 are a main memory 108 (preferably 
implemented as random access memory, RAM) and a variety of secondary storage 
memory 110, such as a hard drive 112, a removable medium storage device 114. a 
command device 118, and a visualization device, 120, Also included is a removable 
5 storage medium 116. 

Detailed Description 
The present invention provides methods for crystallizing a BPI protein product 
where the crystals diffract x-rays with sufficiently high resolution to allow 
determination of the three-dimensional strucmre of the BPI protein product, including 

10 atomic coordinates. The three-dimensional structure (e.g, as provided on computer 
readable media as described herein) is useftil for rational drug design of BPI-related 
(and LBP-, CETP-, PLTP-related) mimetics and/or ligands. Specifically provided is 
a method for crystallizing a recombinant non-glycosylated human BPI analog 
holoprotein comprising a 4S6 amino acid sequence wherein the amino acid serine at 

15 position 351 has been changed to alanine. The three-dimensional structure is useful 
for modeling and/or synthesizing BPI-related mimetics or ligands. Such BPI-related 
mimetics or ligands are useful for treating, inhibiting or preventing BPI-modulated 
diseases. 

The present invention thus includes methods of expressing, purifying and 
20 crystallizing a BPI protein product from suitable sources, such as eukaryotic cells or 
tissues. The present invention also provides crystallized BPI protein products by 
these methods. The crystallized BPI is analyzed by x-ray diffraction techniques to 
obtain high resolution diffraction patterns and atomic coordinates that are suitable for 
molecular modeling. 

25 As used herein, '^BPI protein product" or "BPI protein" includes naturally and 

recombinantly produced BPI protein; natural, synthetic, and recombinant biologically 
active polypeptide fragments of BPI protein; biologically active polypeptide variants 
of BPI protein or fragments thereof, including hybrid fusion proteins and dimers; 
biologically active polypeptide analogs of BPI protein or fragments or variants 

30 thereof, including cysteine-substituted analogs; and BPI-derived peptides. The BPI 
protein products for therapeutic or diagnostic uses may be generated and/or isolated 
by any means known in the art. U.S. Patent No. 5,198,541, the disclosure of which 
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is incorporated herein by reference, discloses recombinant genes encoding and 
methods for expression of BPI proteins including recombinant BPI holoproiein, 
referred to as rBPI (also referred to as rBPIjs or simply rBPIjo) and recombinant 
fragments of BPI. U.S. Patent Application Ser. No. 07/885,501, now abandoned, 
5 and a continuation-in-part thereof, U.S. Patent Application Ser. No. 08/072,063, filed 
May 19, 1993, issued as U.S. Patent 5,439,807 on August 8, 1995 and corresponding 
PCT Application No. 93/04752 filed May 19, 1993, which are all incorporated herein 
by reference, disclose novel methods for the purification of recombinant BPI protein 
products expressed in and secreted from genetically transformed mammalian host cells 
10 in culture and discloses how one may produce large quantities of recombinant BPI 
products suitable for incorporation into stable, homogeneous pharmaceutical 
preparations. 

Biologically active fragments of BPI (BPI fragments) include biologically 
active molecules that have the same or similar amino acid sequence as a natural 

15 human BPI holoprotein, except that the fragment molecule lacks amino-terminal 
amino acids, internal amino acids, and/or carboxy-terminal amino acids of the 
holoprotein. Nonlimiting examples of such fragments include a N-terminal fragment 
of natural human BPI of approximately 25 kD, described in Ooi et al., 7. Exp, Med., 
174:649 (1991), and the recombinant expression product of DNA encoding N- 

20 terminal amino acids from 1 to about 193 or 199 of natural human BPI, described in 
Gazzano-Santoro et al.. Infect, Immun, 60:4754-4761 (1992), and referred to as 
rBPl23. In that publication, an expression vector was used as a source of DNA 
encoding a recombinant expression product (rBPIjs) having the 31 -residue signal 
sequence and the first 199 amino acids of the N-terminus of the mature human BPI, 

25 as set out in Figure 1 of Gray et al., supra, except that valine at position 151 is 
specified by GTG rather than GTC and residue 185 is glutamic acid (specified by 
GAG) rather than lysine (specified by AAG). Recombinant holoprotein (rBPI) has 
also been produced having the sequence (SEQ ID NOS: 1 and 2) set out in Figure I 
of Gray et al. , supra, with the exceptions noted for rBPl23 and with the exception that 

30 residue 417 is alanine (specified by GCT) rather than valine (specified by GTT). 
Other examples include dimeric forms of BPI fragments, as described in U.S. Patent 
No. 5,447,913, and corresponding PCT Application No. PCT/US95/03125, the 



wo 98/58961 



PCT/US98/13007 



18 

disclosures of which are incorporated herein by reference. Preferred dimeric 
products include dimeric BPI protein products wherein the monomers are amino- 
terminal BPI fragments having the N-terminal residues from about 1 to 175 to about 
1 to 199 of BPI holoprotein. A panicularly preferred dimeric product is the dimeric 
form of the BPI fragment having N-terrainal residues 1 through 193, designated 
rBPl42 dimer. 

Biologically active variants of BPI (BPI variants) include but are not limited 
to recombinant hybrid fusion proteins, comprising BPI holoprotein or biologically 
active fragment thereof and at least a ponion of at least one other polypeptide, and 
dimeric forms of BPI variants. Examples of such hybrid fusion proteins and dimeric 
forms are described by Theofan et al. in U.S. Patent Application Serial No. 
07/885,911, now abandoned, and a continuation-in-pan application thereof, U.S. 
Patent Application Serial No. 08/064,693 filed May 19, 1993, issued as U.S. Patent 
5,643,570 on July 1 , 1997 and corresponding PCT Application No. US93/04754 filed 
May 19. 1993, which are all incorporated herein by reference and include hybrid 
fusion proteins comprising, at the amino-terminal end, a BPI protein or a biologically 
active fragment thereof and, at the carboxy-terminal end, at least one constant domain 
of an inmiunoglobulin heavy chain or allelic variant thereof. Similarly configured 
hybrid fusion proteins involving part or all Lipopolysaccharide Binding Protein (LBP) 
are also contemplated for use in the present invention. 

Biologically active analogs of BPI (BPI analogs) include but are not limited 
to BPI protein products wherein one or more amino acid residues have been replaced 
by a different amino acid. For example, U.S. Patent No. 5,420,019 and 
corresponding PCT Application No. US94/01235, filed February 2, 1994, the 
disclosures of which are incorporated herein by reference, discloses polypeptide 
analogs of BPI and BPI fragments wherein a cysteine residue is replaced by a 
different amino acid. A preferred BPI protein product described by this application 
is the expression product of DNA encoding from amino acid 1 to approximately 193 
or 199 of the N-terminal amino acids of BPI holoprotein, but wherein the cysteine at 
residue number 132 is substituted with alanine and is designated rBPIjiAcys or rBPI^i. 
Other examples include dimeric forms of BPI analogs; e.g. U.S. Patent Application 
Serial No. 08/212,132 filed March 11, 1994, issued as U.S. Patent 5,447,913 on 
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September 5. 1995 and corresponding PCT Application No. PCT/US95/03125, the 
disclosures of which are incorporated herein by reference. 

Other BPI protein products useful according to the methods of the invention 
are peptides derived from or based on BPI produced by recombinant or synthetic 

5 means (BPI-derived peptides), such as those described in U.S. Patent Application 
Serial No. 08/504,841 filed July 20, 1995 and in PCT Application No. 
PCT/US94/ 10427 filed September 15, 1994, which corresponds to U.S. Patent 
Application Serial No. 08/306,473 filed September 15, 1994, issued as U.S. Patent 
5,652,332 on July 29, 1997, and PCT Application No. US94/02465 filed March 11, 

10 1994, which corresponds to U.S. Patent Application Serial No. 08/209,762, filed 
March 11. 1994, issued as U.S. Patent 5,733,872 on March 31, 1998, which is a 
continuation-in-part of U.S. Patent Application Serial No. 08/183,222, filed January 
14, 1994, now abandoned, which is a continuation-in-part of U.S. Patent Application 
Ser. No. 08/093,202 filed July 15, 1993, now abandoned, (for which the 

15 corresponding international application is PCT Application No. US94/02401 filed 
March 11, 1994), which is a continuation-in-part of U.S. Patent Application Ser. No. 
08/030,644 filed March 12, 1993, issued as U.S. Patent 5,348,942 on September 20, 
1994, the disclosures of all of which are incorporated herein by reference. 

Presently preferred BPI protein products include recombinantly-produced N- 

20 terminal fragments of BPI, especially those having a molecular weight of 
approximately between 21 to 25 kD such as rBPl23 or rBPIji, or dimeric forms of 
these N-terminal fragments (e.g., rBPl42 dimer). Additionally, preferred BPI protein 
products include rBPIjo and BPI-derived peptides. 

The administration of BPI protein products is preferably accomplished with 

25 a pharmaceutical composition comprising a BPI protein product and a 
pharmaceutically acceptable diluent, adjuvant, or carrier. The BPI protein product 
may be administered without or in conjunction with known surfactants, other 
chemotherapeutic agents or additional known anti-microbial agents. One 
pharmaceutical composition containing BPI protein products (e.g., rBPIso, rRBPIja) 

30 comprises the BPI protein product at a concentration of 1 mg/ml in citrate buffered 
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saline (5 or 20 mM citrate, 150 mM NaCl, pH 5.0) comprising 0.1% by weight of 
poloxamer 188 (Pluronic F-68, BASF Wyandotte, Parsippany, NJ) and 0.002% by 
weight of polysorbate 80 (Tween 80, ICI Americas Inc., Wilmington, DE). Another 
pharmaceutical composition containing BPI protein products (e.g., rBPIj,) comprises 
the BPI protein product at a concentration of 2 mg/mL in 5 mM citrate, 
150 mM NaCl, 0.2% poloxamer 188 and 0.002% polysorbate 80. Such combinations 
are described in PCT Application No. US94/01239 filed February 2, 1994, which 
corresponds to U.S. Patent Application Ser. No. 08/190,869 filed February 2, 1994, 
issued as U.S. Patent 5,488,034 on January 30, 1996, and U.S. Patent Application 
Ser. No. 08/012,360 filed February 2, 1993, now abandoned, the disclosures of all 
of which are incorporated herein by reference. Additional formulations are provided 
in U.S. Patent Application Ser. Nos, 08/372,104, filed January 13, 1995, now 
abandoned, 08/530,599, filed September 19. 1995. now abandoned, and 08/586.133. 
filed January 12. 1996 and corresponding W096/21436 (PCT/US96/01095). 

The x-ray diffraction patterns of the invention are now discovered to be of 
sufficiently high resolution to be useful for three-dimensional modeling of a BPI. 
Preferably the resolution is in the range of 1.5 to 3.5 A, preferably 1.5-3.0 A and 
more preferably ^ 2.6 A. 

Three-dimensional modeling is performed using the diffraction coordinates 
from these x-ray diffraction patterns. The coordinates are entered into one or more 
computer programs for molecular modeling, as known in the art. Such molecular 
modeling can utilize known x-ray diffraction molecular modeling algorithms or 
molecular modeling software to generate atomic coordinates corresponding to the 
three-dimensional structure of at least one BPI or a fragment thereof. 

The entry of the coordinates of the x-ray diffraction patterns and the amino 
acid sequence into such programs results in the calculation of most probable 
secondary, tertiary and quaternary strucmres of the protein, including overall atomic 
coordinates of a BPI or a fragment thereof. These strucmres are combined and 
refined by additional calculations using such programs to determine the probable or 
actual three-dimensional stnicmre of the BPI, including potential or actual active or 
binding sites of the protein. 
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Such molecular modeling (and related) programs useful for rational drug 
design of ligands or mimeiics, are also provided by the present invention. The drug 
design uses computer modeling programs which calculate how different molecules 
interact with the various sites of the BPI. This procedure determines potential ligands 

5 or mimetics of a BPI or at least one fragment thereof. The acmal BPI-ligand 
complexes or mimetics are crystallized and analyzed using x-ray diffraction. The 
diffraction pattern coordinates are similarly used to calculate the three-dimensional 
interaction of a ligand and the BPI or a mimetic, in order to confirm that the ligand 
binds to, or changes the conformation of, a panicular site on the BPI, or where the 

10 mimetic has a similar three-dimensional structure to that of a BPI or a fragment 
thereof- 

The potential ligands or mimetics are then screened for activity relating to a 
BPI. Such screening methods are selected from assays for at least one biological 
activity of the native BPI. 

15 The resulting ligands or mimetics. provided by methods of the present 

invention, are useful for treating, screening or preventing bacterial infections in 
animals, such as manunals (including humans) and birds. Mimetics or ligands of a 
particular BPI will similarly react with other BPIs from other species, subgenera or 
genera of the BPI source organism. 

20 Also provided are biologically active BPI proteins. A BPI protein is also 

provided as a crystallized protein suitable for x-ray diffraction analysis. The x-ray 
diffraction patterns obtained by the x-ray analysis are of moderately high to high 
resolution, e.g., 1.5-3.5 A. The coordinates from these diffraction patterns are 
suitable and useful for three-dimensional modeling of the crystallized protein. 

25 During the three-dimensional modeling of the BPI, these coordinates are 

entered with the BPI amino acid sequence into computer modeling programs to 
generate secondary, tertiary and quaternary structures of the BPI, as atomic 
coordinates. These structures together provide the three-dimensional strucmre of the 
BPI. The calculated and confirmed three-dimensional structure is then used for 

30 rational drug design of ligands or mimetics of the BPI or a fragment thereof. 

The determination of the three-dimensional structure of a BPI protein thus has 
a broad-based utility. Significant sequence identity and conservation of important 
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structural elements is expected to exist among the BPI proteins of a particular species, 
subgenus, genus, or family. Therefore, the three-dimensional structure from one or 
a few BPI proteins can be used to identify therapeutics with one or more of the 
biological activities of BPI (and/or those of related proteins such as LBP, CETP and 
5 PLTP). 

Determination of Protein Structures 

Different techniques give different and complementary information about 
protein structure. The primary structure is obtained by biochemical methods, either 
by direct determination of the amino acid sequence from the protein, or from the 
10 nucleotide sequence of the corresponding gene or cDNA. The quaternary structure 
of large proteins or aggregates can also be determined by electron microscopy. To 
obtain the secondary and tertiary structure, which requires detailed information about 
the arrangement of atoms within a protein, x-ray crystallography is preferred. 
=p The first prerequisite for solving the three-dimensional structure of a protein 

p 15 by x-ray crystallography is a well-ordered crystal that will diffract x-rays strongly. 

4^ The crystallographic method directs a beam of x-rays onto a regular, repeating array 

yi of many identical molecules so that the x-rays are diffracted from it in a pattern from 

which the structure of an individual molecule can be retrieved. Well-ordered crystals 
of globular protein molecules are large, spherical, or ellipsoidal objects with irregular 
^ 20 surfaces, and crystals thereof contain large holes or channels that are formed between 

O the individual molecules. These channels, which usually occupy more than half the 

^ volume of the crystal, are filled with disordered solvent molecules. The protein 

molecules are in contact with each other at only a few small regions. This is one 
reason why structures of proteins determined by x-ray crystallography are generally 
25 the same as those for the proteins in solution. 

The formation of crystals is dependent on a number of different parameters, 
including pH, temperature, protein, concentration, the nature of the solvent and 
precipitant, as well as the presence of added ions or ligands to the protein. Many 
routine crystallization experiments may be needed to screen all these parameters for 
30 the few combinations that might give crystal suitable for x-ray diffraction analysis. 
Crystallization robots can automate and speed up the work of reproducibly setting up 
large number of crystallization experiments. 
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A pure and homogeneous protein sample is important for successful 
crystallization. Proteins obtained from cloned genes in efficient expression vectors 
can be purified quickly to homogeneity in large quantities in a few purification steps. 
A protein to be crystallized is preferably at least 93-99% pure according to standard 
5 criteria of homogeneity. Crystals form when molecules are precipitated very slowly 
from supersaturated solutions. The most frequently used procedure for making 
protein crystals is the hanging-drop method, in which a drop of protein solution is 
brought very gradually to supersaturation by loss of water from the droplet to the 
larger reservoir that contains salt or polyethylene glycol solution. 

10 Different crystal forms can be more or less well-ordered and hence give 

diffraction patterns of different quality. As a general rule, the more closely the 
protein molecules pack, and consequently the less water the crystals contain, the 
better is the diffraction pattern because the molecules are better ordered in the crystal. 
X-rays are electromagnetic radiation at shon wavelengths, emitted when 

15 electrons jump from a higher to a lower energy state. In conventional sources in the 
laboratory, x-rays are produced by high- voltage tubes in which a metal plate, the 
anode, is bombarded with accelerating electrons and thereby caused to emit x-rays of 
a specific wavelength, so-called monochromatic x-rays. The high voltage rapidly 
heats up the metal plate, which therefore has to be cooled. Efficient cooling is 

20 achieved by so-called rotating anode x-ray generators, where the metal plate revolves 
during the experiment so that different parts are heated up. 

More powerful x-ray beams can be produced in synchrotron storage rings 
where electrons (or positrons) travel close to the speed of light. These particles emit 
very strong radiation at all wavelengths from short gamma rays to visible light. 

25 When used as an x-ray source, only radiation within a window of suitable 
wavelengths is channeled from the storage ring. Polychromatic x-ray beams are 
produced by having a broad window that allows through x-ray radiation with 
wavelengths of 0.2-3. SA. 

In diffraction experiments a narrow and parallel beam of x-rays is taken out 

30 from the x-ray source and directed onto the crystal to produce diffracted beams. The 
incident primary beam causes damage to both protein and solvent molecules. The 
crystal is, therefore, usually cooled to prolong its lifetime (^.g..-220 to -50^C). The 
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primary beam must strike the crystal from many different directions to produce all 
possible diffraction spots, and so the crystal is rotated in the beam during the 
experiment. 

The diffracted spots are recorded either on a film, the classical method, or by 
5 an electronic detector. The exposed film has to be measured and digitized by a 
scanning device, whereas electronic detectors feed the signals they detect directly in 
a digitized form into a computer. Electronic area detectors (an electronic film) 
significantly reduce the time required to collect and measure diffraction data. 

When the primary beam from an x-ray source strikes the crystal, some of the 

10 x-rays interact with the electrons on each atom and cause them to oscillate. The 
oscillating electrons serve as a new source of x-rays, which are emitted in almost all 
directions, referred to as scattering. When atoms (and hence their electrons) are 
arranged in a regular three-dimensional array, as in a crystal, the x-rays emitted from 
the oscillating electrons interfere with one another. In most cases, these x-rays, 

15 colliding from different directions, cancel each other out; those from certain 
directions, however, will add together to produce diffracted beams of radiation that 
can be recorded as a panem on a photographic plate or detector. 

The diffraction pattern obtained in an x-ray experiment is related to the crystal 
that caused the diffraction. X-rays that are reflected from adjacent planes travel 

20 different distances, and diffraction only occurs when the difference in distance is 
equal to the wavelength of the x-ray beam. This distance is dependent on the 
reflection angle, which is equal to the angle between the primary beam and the 
planes. 

The relationship between the reflection angle (0), the distance between the 
25 planes (d), and the wavelength (X) is given by Bragg's law: 2d sin 9 = X. This 
relation can be used to determine the size of the unit cell in the crystal. Briefly, the 
position on the film of the diffraction data relates each spot to a specific set of planes 
through the crystal. By using Bragg's law, these positions can be used to determine 
the size of the unit call. 
30 Each atom in a crystal scatters x-rays in all directions, and only those that 

positively interfere with one another, according to Bragg's law, give rise to diffracted 
beams that can be recorded as a distinct diffraction spot above background. Each 
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diffraction spot is the result of interference of all x-rays with the same diffraction 
angle emerging from all atoms. For example, for the protein crystal of myoglobin, 
each of the about 20,000 diffracted beams that have been measured contain scattered 
x-rays from each of the around 1500 atoms in the molecule. To extract information 
5 about individual atoms from such a system requires considerable computation. The 
mathematical tool that is used to handle such problems is called the Fourier 
transform. 

Each diffracted beam, which is recorded as a spot on the film, is defined by 
three propenies: the amplitude, which we can measure from the intensity of the spot; 

10 the wavelength, which is set by the x-ray source; and the phase, which is lost in x-ray 
experiments. All three properties are needed for all of the diffracted beams, in order 
to determine the position of the atoms giving rise to the diffracted beams. 

For larger molecules, protein crystallographers have determined the phases in 
many cases using a method called multiple isomorphous replacement (MIR) (including 

15 heavy metal scattering), which requires the introduction of new x-ray scatterers into 
the unit cell of the crystal. These additions are usually heavy atoms (so that they 
make a significant contribution to the diffraction pattern), such that there should not 
be too many of them (so that their positions can be located); and they should not 
change the structure of the molecule or of the crystal cell, i.e., the crystals should be 

20 isomorphous. Isomorphous replacement is usually done by diffusing different heavy- 
metal complexes into the channels of the preformed protein crystals. The protein 
molecules expose side chains (such as SH groups) into these solvent channels that are 
able to bind heavy metals. It is also possible to replace endogenous light metals in 
metalloproteins with heavier ones, e.g,, zinc by mercury, or calcium by samarium. 

25 Since such heavy metals contain many more electrons than the light atoms 

(H,N,C,0 and S) of the protein, they scatter x-rays more strongly. All diffracted 
beams would therefore increase in intensity after heavy-metal substitution if all 
interference were positive. In fact, however, some interference is negative; 
consequently, following heavy-metal substimtion. some spots measurably increase in 

30 intensity, others decrease, and many show no detectable difference. 

Phase differences between diffracted spots can be determined from intensity 
changes following heavy-metal substitution. First, the intensity differences are used 
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to deduce the positions of the heavy atoms in the crystal unit cell. Fourier 
summations of these intensity differences give maps of the vectors between the heavy 
atoms, the so-called Patterson maps. From these vector maps the atomic arrangement 
of the heavy atoms is deduced. From the positions of the heavy metals in the unit 
cell, one can calculate the amplitudes and phases of their contribution to the diffracted 
beams of protein crystals containing heavy metals. 

This knowledge is then used to find the phase of the contribution from the 
protein in the absence of the heavy-metal atoms. As both the phase and amplitude 
of the heavy metals and the amplitude of the protein alone is known, as well as the 
amplimde of the protein plus heavy metals (i.e., protein heavy-metal complex), one 
phase and three amplitudes are known. From this, the interference of the x-rays 
scattered by the heavy metals and protein can be calculated to see if it is constructive 
or destructive. The extent of positive or negative interference, with knowledge of the 
phase of the heavy metal, given an estimate of the phase of the protein. Because two 
different phase angles are determined and are equally good solutions, a second heavy- 
metal complex can be used which also gives two possible phase angles. Only one of 
these will have the same value as one of the two previous phase angles; it therefore 
represents the correct phase angle. In practice, more than two different heavy-metal 
complexes are usually made in order to give a reasonably good phase determination 
for all reflections. Each individual phase estimate contains experimental errors 
arising from errors in the measured amplitudes. Furthermore, for many reflections, 
the intensity differences are too small to measure after one particular isomorphous 
replacement, and others can be tried. 

The amplitudes and the phases of the diffraction data from the protein crystals 
are used to calculate an electron-density map of the repeating unit of the crystal. This 
map then has to be interpreted as a polypeptide chain with a particular amino acid 
sequence. The interpretation of the electron-density map is made more complex by 
several limitations of the dau. First of all, the map itself contains errors, mainly due 
to errors in the phase angles. In addition, the quality of the map depends on the 
resolution of the diffraction data, which in turn depends on how well-ordered the 
crystals are. This directly influences the image that can be produced. The resolution 



wo 98/58961 



PCT/US98/13007 



27 

is measured in A units; the smaller this number is, the higher the resolution and 
therefore the greater the amount of detail that can be seen. 

Building the initial model is a trial-and-error process. First, one has to decide 
how the polypeptide chain weaves its way through the electron-density map. The 
resulting chain trace constimtes a hypothesis, by which one tries to match the density 
of the side chains to the known sequence of the polypeptide. When a reasonable 
chain trace has finally been obtained, an initial model is built to give the best fit of 
the atoms to the electron density. Computer graphics are used both for chain tracing 
and for model building to present the data and manipulated the models. 

The initial model will contain some errors. Provided the protein crystals 
diffract to high enough resolution (e.g., better than 3.5 A), most or substantially all 
of the errors can be removed by crystallographic refinement of the model using 
computer algorithms. In this process, the model is changed to minimize the 
difference between the experimentally observed diffraction amplitudes and those 
calculated for a hypothetical crystal containing the model (instead of the real 
molecule). This difference is expressed as an R factor (residual disagreement) which 
is 0.0 for exact agreement and about 0.59 for total disagreement. 

In general, the R factor is preferably between 0.15 and 0.35 (such as less than 
about 0.24-0.28) for a well-determined protein structure. The residual difference is 
a consequence of errors and imperfections in the data. These derive from various 
sources, including slight variations in the conformation of the protein molecules, as 
well as inaccurate corrections both for the presence of solvent and for differences in 
the orientation of the microcrystals from which the crystal is built. This means that 
the final model represents an average of molecules that are slightly different both in 
conformation and orientation. 

In refined structures at high resolution, there are usually no major errors in 
the orientation of individual residues, and the estimated errors in atomic positions are 
usually around 0.1-0.2 A, provided the amino acid sequence is known. Hydrogen 
bonds, both within the protein and to bound ligands, can be identified with a high 
degree of confidence. 

Those of skill in the an understand that a set of structure coordinates 
determined by X-ray crystallography is not without standard error. For the purpose 
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of this invention, any set of structure coordinates for BPI-protein that have a root 
mean square deviation of protein backbone atoms (N, Ca, C and O) of less that 
0.75 A when superimposed — using backbone atoms - on the structure coordinates 
listed in Table 4 shall be considered identical. 
5 Most x-ray structures are determined to a resolution between 1.7 A and 3.5 

A. Electron-density maps with this resolution range are preferably interpreted by 
fitting the known amino acid sequences into regions of electron density in which 
individual atoms are not resolved. 

An amino acid sequence is preferred for accurate x-ray structure 

10 determination. Thus, recombinant DNA techniques have had a double impact on x- 
ray structural work. When a protein is cloned and overexpressed for structural 
studies, the amino acid sequence, necessary for the x-ray work, is also quickly 
obtained via the nucleotide sequence. Recombinant DNA techniques give us not only 
abundant supplies of rare proteins, but also their amino acid sequence as a bonus. 

15 Overview of BPI Purification and Crystallization Methods 

In general, a BPI protein is purified as described in Example 1. The resulting 
BPI is in sufficient purity and concentration for crystallization. The BPI is then 
isolated and assayed for biological activity and for lack of aggregation (which 
interferes with crystallization). The purified BPI preferably runs as a single band 

20 under reducing or nonreducing polyacrylamide gel electrophoresis (PAGE) 
(nonreducing is used to evaluate the presence of cysteine bridges). 

The purified BPI is preferably crystallized using the hanging drop method 
under varying conditions of at least one of the following: pH, buffer type, buffer 
concentration, salt type, polymer type, polymer concentration, other precipitating 

25 agents and concentration of purified and cleaved BPI. See, e.g,, the methods 
provided in a conunercial kit, such as CRYSTAL SCREEN (Hampton Research, 
Riverside, CA). Differently sized and shaped crystals are tested for suitability for x- 
ray diffraction. Generally, larger crystals provide better crystallography than smaller 
crystals, and thicker crystals provide better diffraction than thinner crystals. 

30 Purified BPIs 

The results of the purification are optionally analyzed by polyacrylamide gel 
electrophoresis (PAGE) under reducing or non-reducing conditions. A single band 
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is preferably obtained. With disulfide-containing BPIs, it is preferred that the analysis 
of the cleaved BPI be under non-reducing conditions to indicate whether the cleaved 
protein formed disulfide linked dimers. The amino acid sequence can also be 
determined according to known methods* or otherwise obtained, as this sequence is 
5 important in determining the three-dimensional structure of the cleaved protein (in 
combination with crystallographic analysis), as described herein, using molecular 
modeling techniques. 

Before crystallization, the purified protein is tested for one or more of the 
known biological activities of a BPI protein. 
10 It is preferred that the biological activity exceed the activity of the native 

protein. The preferred result indicates that the BPI protein retains its native strucmre, 
which is important for determining the three-dimensional crystal strucmre of the 
biologically active molecule. To identify the protease cleavage site, the purified and 
cleaved protein can be sequenced using known techniques. See, e.g.. Muni et aL, 
15 Ptoc. NaxL Acad. Sci. USA 90:1523-1525 (1993); Takimoto et aL (1992), infra. 
entirely incorporated herein by reference. 
Protein Crystallization Methods 

The hanging drop method is preferably used to crystallize the purified protein. 
See. Taylor et aL^J. MoL BioL 226:1287-1290 (1992); Takimoto et aL (1992), 
20 infra\ CRYSTAL SCREEN, Hampton Research. 

A mixture of the purified protein and precipitant can include the following: 
pH (e.g., 4-9); 

• buffer type {e. g. , phosphate, cacodylate, acetates, imidazole, Tris HCl, 
sodium HEPES); 

25 • buffer concentration (e.g., 10-200 mM); 

• salt type (e.g. , calcium chloride, sodium citrate, magnesium chloride, 
ammonium acetate, ammonium sulfate, potassium phosphate, 
magnesium acetate, zinc acetate; calcium acetate) 

• polymer type and concentration: {e.g., polyethylene glycol (PEG) 
30 1-50%, average molecular weight 200-10.000); 
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other precipitating agents (salts: K, Na tartrate, NH4SO4, NaAc, 
LiS04. NaFormate, NaCitrate, MgFormate, NaP04, KPO4 NH4PO4; 
organics: 2-propanol; non-volatile: 2-methyl-2,4-pentanediol); and 

• concentration of purified BPI (e.g., 1.0-1(X) mg/ml). 
See, e.g., CRYSTAL SCREEN, Hampton Research. 

A non-limiting example of such crystallization conditions is the following: 

• purified protein {e.g., approximately 3-4 mg/ml); 

• precipitant 10-14% Polyethylene glycol (PEG) 8000 buffered with 100 
mM cacodylate buffer and 2(X) mM of Mg acetate; 

• at an overall pH of about 3.5-8.5. 

The above mixtures are used and screened by varying at least one of pH, 
buffer type; buffer concentration, precipitating salt type or concentration, PEG type, 
PEG concentration, and protein concentration. Crystals ranging in size from 0.2-0.7 
nmi are formed in 1 -7 days. These crystals diffract x-rays to at least 3.5 A 
resolution, such as 1.5 -3.5 A, or any range of value therein, such as 1.5, 1.6, 1.7, 
1.8, 1.9, 2.0, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, or 3.0, with 3.0 A or less 
being preferred. 
Protein Crystals 

Crystals appear after 1-4 days and grow to maximal size within a week. From 
one ten crystals are observed in one drop and crystal forms can occur, such as, but 
not limited to, bipyramidal, rhomboid^ and cubic. Initial x-ray analyses indicate that 
such crystals diffract at moderately high to high resolution. When fewer crystals are 
produced in a drop, they can be much larger size, e.g., 0.4-0.9 mm. 
X-ray Crystallography Methods and Molecular Modeling 

The crystals so produced for BPI are x-ray analyzed using a suitable x-ray 
source. Diffraction patterns are obtained. Crystals are preferably stable for at least 
10 hrs in the x-ray beam. Frozen crystals (e.g., -220 to -50*C) could also be used 
for longer x-ray exposures {e.g. , 24-72 hrs), the crystals being relatively more stable 
to the x-rays in the frozen state. To collect the maximum number of useful 
reflections, multiple frames are optionally collected as the crystal is rotated in the x- 
ray beam, e.g. , for 24-72 hrs. Larger crystals ( > 0.2 mm) are preferred, to increase 
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the resolution of the x-ray diffraction. Alternatively, crystals may be analyzed using 
a synchrotron high energy x-ray source. Using frozen crystals, x-ray diffraction data 
is collected on crystals that diffract to a relatively high resolution of 3.5 A or less, 
sufficient to solve the three-dimensional structure of BPI in considerable detail, as 

5 presented herein. 

Native and/or derivative x-ray diffraction data with medium resolution is 
collected on area detectors mounted on rotating anode x-ray sources. The alternative 
program DENZO is preferably used for data processing and reduction (Sawyer et aL , 
eds.. Proceedings of CCP4 Study Weekend, pp. 56-62, SERC Darsbary Lab., UK 

10 (1993)). 

The resolution is optionally improved using larger crystals, e.g., 0.2nmi, 
making data collection more efficient, particularly for the determination of suitable 
heavy metal derivatives, such as Hg, Pt, Pb, Ba, Cd, and/or La derivatives. 

The heavy metal derivatives are used to determine the phase, e.g., by the 

15 isomorphous replacement method. Heavy atom isomorphous derivatives of BPI are 
used for x-ray crystallography, where the structure is solved using one or several 
derivatives, which, (when combined) improves the overall figure of merit. 
Derivatives are identified through Patterson maps and/or cross-phase difference 
Fourier maps, e.g. , using the CCP4 package (SERC Collaborative Computing Project 

20 No. 4, Daresbury Laboratory, UK, 1979). 

Phases were also obtained or improved by optimization of the anomalous 
dispersion component of the x-ray scattering which can break the phase ambiguity 
which a single heavy atom derivative gives. In certain cases phase information may 
be obtained without the need of a native set of data, through the use of multiple 

25 wavelength with anomalous dispersion phasing (MAD phasing). The wavelength of 
the x-rays used may be selected at a synchrotron source to optimize this anomalous 
scattering. In this case data from a derivatised crystal or crystals is collected at 
typically three wavelengths, two of which are very close to the absorption edge of the 
heavy atom scatterer. One way of obtaining a suitable heavy atom derivatised crystal 

30 is to derivatise a known ligand of the protein. 

The program MLPHARE (Wolf et aL, eds., Isomorphous Replacement and 
Anomalous Scattering: Proceedings of CCP4 Study Weekend, pp. 80-86, SERC 
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Daresbury Lab., UK (1991)) is optionally used for refinement of the heavy atom 
parameters and the phases derived from them by comparing at least one of 
completeness (%), resolution (A), W (%), heavy atom concentration (mM), soaking 
time, heavy atom sites, phasing power (acentric, centric) (See Table 1 as an 
analogous example from The Crystal Structure of diphtheria toxin, Choe et ai.. 
Nature 357: 216-222 (1992). Addition of heavy atom derivatives produce an MIR 
map with recognizable features. 

The initial phases are calculated to 3.2 A, and then improved and extended 
to a higher resolution of 2.8 A {e.g., ^3.0 A) using solvent flattening, histogram 
matching and/or Sayre's equation in the program DM (Cowtan and Main, Acta 
Crystallogr, D 49:148-157 (1993)). The skeletonization of DM procedure is 
optionally used to improve connectivity in the bulk of the protein envelope. Both the 
MIR and density modified maps are optionally used in subsequent stages, to provide 
sufficient resolution and/or modeling of surface strucmres. 

Skeletonized representations of electron density maps are then computed. 
These maps are automatically or manually edited using suitable software, e.g., the 
graphics package FRODO (Jones et al. (1991), infra) to give a continuous Ca trace. 
The BPI sequence is then aligned to the trace. Initially pieces of idealized 
polypeptide backbone were placed into regions of the electron density map with 
obvious secondary structures (e.g.. a-helix, jS-sheet). After a polyalanine model was 
constructed for the protein, amino acid sidechains were added where density was 
present in the m^s. The amino acid sequence of BPI was then examined for regions 
with distinct sidechain patterns (e.g., three consecutive aromatic rings). When a 
pattern in the sequence was found to match an area of the map, the correct sidechains 
were built onto the existing model. Eventually fi'agments containing recognizable 
sequence motifs were connected into a single chain, completing the tracing of the 
amino acid sequence into the maps. 

X-ray diffraction data {e.g. , to < 3 .0 A) was collected on an RAXIS 1 IC area 
detector (e.g., a Mar imaging plate) mounted on a RIGAKU rotating anode or 
alternatively a synchrotron x-ray source, and processed using a suitable oscillation 
data reduction program (DENZO, Sawyer et aL eds., Proceedings of CCP4 Study 
Weekend, pp. 56-62, SERC Darsbary Lab., UK (1993). Cycles of simulated 
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annealing against these data were refined using the program X-PLOR for molecular 
dynamics for R- factor refinement (X-PLOR, Briinger et aL, 7. MoL BioL 20i: 803- 
816 (1987)). This refinement was followed by manual rebuilding with FRODO using 
experimental and 2¥^-¥^ maps. The model can be optionally further refined using a 

5 least-squares refinement program, such as TNT (Tronrud et al. , Acta Crystallogr. A 
45:489-501 (1987)). 

One or more of the above modeling steps is performed to provide a molecular 
3-D model of BPI. It is preferred that the BPI model has no residues in disallowed 
regions of the Ramachandran plot, and gives a positive 3D- ID profile (Luthy et aL , 

10 Nature iJ6:83-85 (1992)), suggesting that all the residues are in acceptable 
environments (Kraulis (1991), infra). 

Multiple isomorphous replacement phase determination was used for solving 
the three-dimensional structure of BPI. This structure is then used for rational drug 
design of BPI ligands or mimetics of at least one BPI bactericidal activity, or other 

15 biological activity important in inactivating bacterial toxicity, replication and/or 
infection. 

Computer Related Embodiments 

An amino acid sequence of a BPI protein (or related protein such as LBP, 
CETP or PLTP) and/or x-ray diffraction data, useful for computer molecular 

20 modeling of BPI protein (or related protein such as LBP, CETP or PLTP) or a 
ponion thereof, can be "provided" in a variety of mediums to facilitate use thereof. 
As used herein, provided refers to a manufacture, which contains, for example, a BPI 
amino acid sequence and/or atomic coordinate/x-ray diffraction data of the present 
invention, e.g., an amino acid sequence provided in Figure 5, a representative 

25 fragment thereof, or an amino acid sequence having at least 80-100% overall identity 
to an amino acid fragment of an amino acid sequence of Figure 5 or a variant thereof. 
Such a method provides the amino acid sequence and/or x-ray diffraction data in a 
form which allows a skUled artisan to analyze and molecular model the three- 
dimensional structure of a BPI-related protein, including a subdomain thereof. 

30 In one application of this embodiment, BPI (or related protein such as LBP, 

CETP or PLTP), or at least one subdomain thereof, amino acid sequence and/or x-ray 
diffraction data of the present invention is recorded on computer readable medium. 
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As used herein, "computer readable medium" refers to any medium which can be 
read and accessed directly by a computer. Such media include, but are not limited 
to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as optical discs or CD-ROM; electrical 

5 storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how any of 
the presently known computer readable mediums can be used to create a manufacture 
comprising computer readable medium having recorded thereon an amino acid 
sequence and/or x-ray diffraction data of the present invention, 

10 As used herein, "recorded" refers to a process for storing information on 

computer readable medium. A skilled artisan can readily adopt any of the presently 
know methods for recording information on computer readable medium to generate 
manufactures comprising an amino acid sequence and/or atomic coordinate/x-ray 
diffraction data information of the present invention. 

IS A variety of data storage structures are available to a skilled artisan for 

creating a computer readable medium having recorded thereon an amino acid 
sequence and/or atomic coordinate/x-ray diffraction data of the present invention. 
The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs 

20 and formats can be used to store the sequence and x-ray data information of the 
present invention on computer readable medium. The sequence information can be 
represented in a word processing text file, formatted in commercially-available 
software such as WordPerfect and MICROSOFT Word, or represented in the form 
of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or 

25 the like. A skilled artisan can readily adapt any number of dataprocessor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium 
having recorded thereon the information of the present invention. 

By providing computer readable medium having stored thereon a BPI or 
related sequence protein and/or atomic coordinates based on x-ray diffraction data, 

30 a skilled artisan can routinely access the sequence and atomic coordinate or x-ray 
diffraction data to model a BPI or related protein, a subdomain thereof, mimetic, or 
a ligand thereof. Computer algorithms are publicly and conmiercially available which 
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allow a skilled artisan to access this data provided in a computer readable medium 
and analyze it for molecular modeling and/or RDD. See, e.g.. Biotechnology 
Software Directory, Mary Ann Liebert Publ., New York (1995). 

The present invention further provides systems, panicularly computer-based 
5 systems, which contain the sequence and/or diffraction data described herein. Such 
systems are designed to do structure determination and RDD for a BPI or related 
protein or at least one subdomain thereof. Non-limiting examples are microcomputer 
workstations available from Silicon Graphics Incorporated and Sun Microsystems 
running UNIX based, Windows NT or IBM OS/2 operating systems. 

10 As used herein, "a computer-based system" refers to the hardware means, 

software means, and data storage means used to analyze the sequence and/or x-ray 
diffraction data of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing unit 
(CPU), input means, output means, and data storage means. A skilled artisan can 

15 readily appreciate which of the currently available computer-based systems are 
suitable for use in the present invention. A visualization device, such as a monitor, 
is optionally provided to visualize strucmre data. 

As stated above, the computer-based systems of the present invention comprise 
a data storage means having stored therein a BPI or related protein or fragment 

20 sequence and/or atomic coordinate/x-ray diffraction data of the present invention and 
the necessary hardware means and software means for supporting and implementing 
an analysis means. As used herein, "data storage means" refers to memory which 
can store sequence or atomic coordinate/x-ray difft-action data of the present 
invention, or a memory access means which can access manufactures having recorded 

25 thereon the sequence or x-ray data of the present invention. 

As used herein, "search means" or "analysis means" refers to one or more 
programs which are implemented on the computer-based system to compare a target 
sequence or target structural motif with the sequence or x-ray data stored within the 
data storage means. Search means are used to identify fragments or regions of a BPI 

30 or related protein which match a panicular target sequence or target motif. A variety 
of known algorithms are disclosed publicly and a variety of conmiercially available 
software for conducting search means are and can be used in the computer-based 
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systems of the present invention. A skilled artisan can readily recognize that any one 
of the available algorithms or implementing software packages for conducting 
computer analyses can be adapted for use in the present computer-based systems. 

As used herein, "a target structural motif," or "target motif," refers to any 
5 rationally selected sequence or combination of sequences in which the sequence(s) are 
chosen based on a three-dimensional configuration or electron density map which is 
formed upon the folding of the target motif. There are a variety of target motifs 
known in the art. Protein target motifs include, but are not limited to, enzymic active 
sites, strucmral subdomains, epitopes, functional domains and signal sequences. A 

10 variety of structural formats for the input and output means can be used to input and 
output the information in the computer-based systems of the present invention. 

A variety of comparing means can be used to compare a target sequence or 
target motif with the data storage means to identify structural motifs or electron 
density maps derived in part from the atomic coordinate/x-ray diffraction data. A 

15 skilled artisan can readily recognize that any one of the publicly available computer 
modeling programs can be used as the search means for the computer-based systems 
of the present invention. 

One application of this embodiment is provided in Figure 6. Figure 6 
provides a block diagram of a computer system 102 that can be used to implement 

20 the present invention. The computer system 102 includes a processor 106 connected 
to a bus 104. Also connected to the bus 104 are a main memory 108 (preferably 
implemented as random access memory, RAM) and a variety of secondary storage 
memory 110, such as a hard drive 112, a removable medium storage device 114, a 
command device 118, and a visualization device, 120. Also included is a removable 

25 storage medium 1 16. The removable medium storage device 1 14 may represent, for 
example, a floppy disk drive, a CD-ROM drive, a magnetic tape drive, etc. A 
removable storage medium 116 (such as a floppy disk, a compact disk, a magnetic 
tape, etc.) containing control logic and/or data recorded therein may be inserted into 
the removable storage device 114. The computer system 102 includes appropriate 

30 software for reading the control logic and/or the data from the removable storage 
medium 116 once inserted in the removable mediimi storage device 114. 
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Amino acid, encoding nucleotide or other sequence and/or atomic 
coordinate/x-ray diffraction data of the present invention may be stored in a well 
known manner in the main memory 108, or any of the secondary storage devices 1 10, 
and/or a removable storage medium 116. Software for accessing and processing the 
5 amino acid sequence and/or atomic coordinate/x-ray diffraction data (such as search 
tools, comparing tools, etc.) reside in main memory 108 during execution. User 
commands are implemented through a conmiand device 118, such as a keyboard. 
The visualization device 120 is optionally used to visualize the structure data. 
Structure Determination 

10 One or more computational steps, computer programs and/or computer 

algorithms are used to provide a molecular 3-D model of BPI or related protein, using 
amino acid sequence data from Figure 5 (or fragments or variants thereof) and/or 
atomic coordinate/x-ray diffraction data. In x-ray crystallography, x-ray diffraction 
data and phases are combined to produce electron density maps in which the three- 

15 dimensional structure of a BPI protein is then built or modeled. MIR Phase 
determination was used for solving the three-dimensional structure of BPI. This 
structure can then be used for RDD of mimetics or ligands of a BPI or related protein 
and its associated biological activity, which is relevant to a protein modulated disease. 
Density Modification and Map Interpretation 

20 Electron density maps were calculated by X-PLOR or alternatively using such 

programs as those from the CCP4 computing package (SERC (UK) Collaborative 
Computing Project 4, Daresbury Laboratory. UK, 1979). If non-crystallographic 
synmietry axes are present, cycles of symmetry averaging can further be used, such 
as with the program RAVE (Kleywegt & Jones, Bailey et aL, eds.. First Map to 

25 Final Model, SERC Daresbury Laboratory, UK, pp 59-66 (1994)) and gradual model 
expansion. For map visualization and model building the program FRODO was used 
or alternatively, a program such as "O" (Jones (1991). infra) can be used. 

Refinement and Model Validation 

Rigid body and positional refinement can be carried out using a program such 
30 as X-PLOR (Brunger (1992), infra), e.g., with the stereochemical parameters of Engh 
and Ruber {Acta Cryst. A47:392-4O0 (1991)), If the model at this stage in the 
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averaged maps is still missing residues (e.g., at least 5-10 per subunit), some or all 
of the missing residues can be incorporated in the model during additional cycles of 
positional refinement and model building. The refinement procedure can stan using 
data from lower resolution (e.g., 25-lOA to 10-3.0 A and then be gradually extended 
5 to include higher resolution data from 12-6A to 3.0-1.5 A). B- values (also termed 
temperamre factors) for individual atoms were refined once data between 2.9 and 1 .5 
A has been added. Subsequently waters were gradually added by manual inspection 
of electron density maps. Alternatively, a program such as ARP (Lamzin and 
Wilson, Acta Cryst. 129-147 (1993)) can be used to add crystallographic waters 

10 and as a tool to check for bad areas in the model. The programs PROCHECK 
(Lackowski et aL, 7. AppL Cryst. 26:283-291 (1993)), WHATIF (Vriend, 7. MoL 
Graph. 5:52-56 (1990)), PRORLE 3D (Luthy et aL, Nature 556:83-85 (1992)), and 
ERRAT (Colovos & Y^iGs Protein Science, 2:1511-19 (1993)) as well as the 
geometrical analysis generated by X-PLOR were used to check the structure for 

15 errors. Anisotropic scaling between and Fc^ic was applied after careful assessment 
of the quality and completeness of the data. 

The program DSSP was used to assign the secondary strucmre elements 
(Kabsch and Sander, Biopoiymers, 22:2577-2637 (1983)). A program such as 
SUPPOS (from the BIOMOL crystallographic computing package) can be used for 

20 some or all of the least-squares superpositions of various models and parts of models. 
The program ALIGN (Cohen 7. MoL BioL, 190: 593-604 (1986)) was used to 
superimpose N- and C-terminal domains of BPI. Solvent accessible surfaces and 
electrostatic potentials can be calculated using such programs as GRASP (Nicholls et 
aL (1991), infixi). 

25 The structure of BPI from different organisms and the related proteins LBP, 

CETP and PLTP can thus be solved with the molecular replacement procedure such 
as by using X-PLOR (Briinger (1992), infra). A partial search model for a portion 
or all of these proteins can be constructed using the structures of BPI. The rotation 
and translation function can be used to yield orientations and positions for these 

30 models. Symmetry averaging can also be done using the RAVE program and model 
expansion can also be used to add missing residues resulting in a model with 95- 
99.9% of the total number of residues. The model can be refined in a program such 
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as X-PLOR (Briinger (1992), supra), to a suitable crystallographic Rf^„„. The model 
data is then saved on computer readable medium for use in further analysis, such as 
rational drug design. 

Rational Design of Mimetics or Ligands 

The determination of the crystal structure of a BPI protein, as described 
herein, provides a basis for the design of new and specific agents, including proteins 
or organic compounds. 

Several approaches can be taken for the use of the crystal structure of a BPI 
in the rational design of protein or organic analogs having a relevant activity similar 
to that of a BPI or related protein. A computer-assisted, manual examination of a 
BPI potential binding site structure is optionally done. The use of software such as 
GRID — Goodford, 7. Med, Chem, 28:849-857 (1985) a program that determines 
probable interaction sites between probes with various functional group characteristics 
and the protein surface — is used to analyze the surface sites to determine structures 
of similar inhibiting proteins or compounds. The GRID calculations, with suitable 
inhibiting groups on molecules {e.g., protonated primary amines) as the probe, are 
used to identify potential hotspots around accessible positions at suitable energy 
contour levels. 

A diagnostic or therapeutic BPI or related protein modulating ligand of the 
present invention can be, but is not limited to, at least one selected from a lipid, a 
nucleic acid, a compound, a protein, an element, an antibody, a saccharide, an 
isotope, a carbohydrate, an imaging agent, a lipoprotein, a glycoprotein, an enzyme, 
a detectable probe, and antibody or fragment thereof, or any combination thereof, 
which can be deiectably labeled as for labeling antibodies. Such labels include, but 
are not limited to, enzymatic labels, radioisotope or radioactive compounds or 
elements, fluorescent compounds or metals, chemiluminescent compounds and 
bio luminescent compounds. Alternatively, any other known diagnostic or therapeutic 
agent can be used in a method of the invention. Suitable compounds are then tested 
for activities of a BPI protein or BPI mimetic. 

The program DOCK (Kuntz et al. 7. MoL Biol., 767:269-288 (1982)) may be 
used to analyze an active site or ligand binding site and suggest ligands with 
complementary steric properties. Several methodologies for searching three- 
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dimensional databases to test pharmacophore hypotheses and select compounds for 
screening are available. These include the program CAVEAT (Bacon et al, 7. Mol 
BioL, 225: 849-858 (1992)) which uses databases of cyclic compounds which can act 
as "spacers" to connect any number of chemical fragments already positioned in the 
5 active site. This allows one skilled in the art to quickly generate hundreds of possible 
ways to coimect the fragments already known or suspected to be necessary for tight 
binding. The program LUDI (Bohm et al. J. Compia,-Aid, MoL Des,, 6:61-78 
(1992)) can determine a list of interactions sites into which to place both hydrogen 
bonding and hydrophobic fragments. LUDI then uses a library of approx. 600 linkers 

10 to connect up to four different interaction sites into fragments. Then smaller 
"bridging" groups such as -CH2- and -COO- are used to connect these fragments. 
For example, for the enzyme DHFR, the placements of key functional groups in the 
well-known inhibitor methotrexate were reproduced by LUDI. See also, Rotstein and 
Muxcko, y, Med. Chem,, 56:1700-1710 (1992)). 

15 After preliminary experiments are done to determine the of a ligand (e.g. , 

a lipid ligand) by BPI (or related) protein to a BPI (or related) protein, mimetic or 
fragment, the time-dependent nature of the inhibition by the BPI or related protein 
(^.g.. by the method of Henderson (Biochem. /. 727:321-333 (1972)) is determined. 
For example, a lipid ligand and a BPI mimetic are pre- incubated in buffer. 

20 Reactions are initiated by the addition of detecting substrate. Aliquots are removed 
over a suitable time course and each quenched by addition into the aliquots of suitable 
quenching solution. The concentration of product are determined by known methods 
of detection. Plots of activity against time can be close to linear over the assay 
period, and are used to obtain values for the initial velocity in the presence (V^) or 

25 absence (V^) of, for example, a BPI mimetic. Error is present in both axes in a 
Henderson plot, making it inappropriate for standard regression analysis 
(Leatherbarrow, Trends Biochem. Sci. 75:455-458 (1990)). Therefore, values are 
obtained from the data by fitting to a modified version of the Henderson equation for 
competitive inhibition: 
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where (using the notation of Henderson (Biochem, 7. 727:321-333 (1972)): 

- ■ (^) 

and 

5 

This equation is solved for the positive root with the constraint that Q = K^dA^ + K,) 
/ K^) using PROCNLIN from SAS (SAS Institute Inc., Gary, North Carolina, USA) 
which performs nonlinear regression using least-square techniques. The iterative 
method used is optionally the multivariate secant method, similar to the Gauss- 
10 Newton method, except that the derivatives in the Taylor series are estimated from 
the histogram of iterations rather than supplied analytically. A suitable convergence 
criterion is optionally used, e.g. , where there is a change in loss function of less than 
10«. 

Once modulating compounds are found, crystallographic studies of co- 
15 complexes, for example, BPI mimetics complexed to a ligand are performed. As 
used herein, a co-complex refers to a BPI protein, fragment, analog or variant thereof 
in covalent or non-covalent association with a chemical entity or compound. As a 
non-limiting example, BPI crystals are soaked for 2 days in 0.01-100 mM inhibitor 
compound and x-ray diffraction data are collected on an area detector and/or an image 
20 plate detector (e,g, , a Mar image plate detector) using a rotating anode x-ray source. 
Data are collected to as high a resolution as possible, e.g., ^3.0 A, and merged 
with a suitable /?-factor on intensities. An atomic model of the mimetic is built into 
the difference Fourier map (Fi„hibitor complex -/^native). The model can be refined to 
convergence in a cycle of simulated annealing (Briinger (1987), infin) involving 10- 
25 100 cycles of energy refinement, 100-10,000 1 -fs steps of room temperamre dynamics 
and/or 10-100 more cycles of energy refinement. Harmonic restraints may be used 
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for the atom refinement, except for atoms within a 10-15 A radius of the inhibitor. 
An /^-factor is calculated for the model as well as an r.m.s. deviation from the ideal 
bond lengths and angles. 

Direct measurements of activity provide ftinher confirmation that the modeled 
mimetic compounds are high-affinity inhibitors for the lipid ligands. Other suitable 
assays for biological activity known for BPI or related proteins may be used. 

Preferably, little or no change in the structure of the BPI or mimetic occurs 
in the electron density map described above. values are determined by a 
previously described method (Henderson (1972). infra) to evaluate mimetic proteins 
or organic compounds. 

Atomic coordinates of BP! proteins are useful in the generation of molecular 
models of related proteins and of BPI mimetics. The atomic coordinates generated 
from the solved three-dimensional structure of BPI disclosed herein may be utilized 
in combination with additional structural and/or physicochemical information, such 
as amino acid sequence data, x-ray diffraction data, combinations of x-ray diffraction 
data from multiple isomorphous replacement molecular replacement techniques, or 
other phase determination techniques. These combinations may be used to generate 
other three-dimensional coordinate data useful to generate secondary, tertiary and/or 
quaternary structures and/or domains of BPI or related proteins, including BPI-related 
lipid binding proteins, or their fragments, analogs, or variants. These alternate 
coordinate sets are useful to provide overall three-dimensional structure, as well as 
binding and or active sites of a BPI or related protein, including BPI-related lipid 
binding proteins, or their fragments, analogs, or variants. These alternate coordinate 
data sets are also useful in molecular modeling computer-based systems and methods 
for rational drug design of mimetics and ligands of BPI and other related proteins » 
including other BPI-related lipid binding proteins. Utilizing CLUSTAL (a multiple 
sequence alignment program in PC-Gene) and the Homology module (a structure- 
based homology modeling program in Insightll on a Silicon Graphics Incorporated 
workstation, molecular models (and the corresponding three-dimensional coordinates 
files) of lipopoly saccharide binding protein (LBP), cholesteryl ester transfer protein 
(CETP) and phospholipid transfer protein (PLTP) are generated. With these files, 
existing mutants are mapped and new ones designed. 
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The results described herein demonstrate that tight-binding mimetics of a BPI 
or related protein, based on the crystal structure of BPI, are provided by the present 
invention. Demonstration of clinically relevant levels of a biological activity of the 
mimetic is also useful. 
5 In evaluating mimetics for biological activity in animal models (e.g., mouse, 

rat, rabbit, baboon) various oral and parenteral routes of administration are evaluated. 
Using this approach, it is expected that a biological activity occurs in suitable animal 
models, e.g., using the mimetics discovered by structure determination and x-ray 
crystallography. 

10 Having now generally described the invention, the same will be more readily 

understood through reference to the following examples which are provided by way 
of illustration, and are not intended to be limiting of the present invention. 

Example 1: Preparation and Purification for Crystallizjation 
Construction of Plasmids Containing BPI (S351A) 

15 BPI contains a single N-linked glycosylation site at the asparagine at position 

349 which was eliminated by genetic engineering of the DNA sequence of BPI as 
follows. For glycosylation to occur at this position, the asparagine must occur within 
the sequence Asn-X-Ser/Thr where X can be any amino acid, except proline. 
N-linked glycosylation can be eliminated by either changing the Asn to another amino 

20 acid such as glutamine or by changing the serine or threonine to an alternate amino 
acid. The latter strategy was used to construct vectors containing BPI with an alanine 
at position 351 instead of serine. 
Construction of Plasmids for BPI Expression 

The plasmid pIClOS containing a cDNA encoding BPI cloned in a T3T7 

25 plasmid (Clontech, Palo Alto, CA) served as the starting point for the construction 
of a vector for expression of nonglycosylated rBPI in mammalian cells. 

To allow insertidn of BPI into an optimized manunalian expression vector, a 




^nique Xhol site was firs\ added to the 3' end of the BPI gene in pIClOS. Two 
"oligonucleotides were synth^Lced for this purpose: BPI-53 (5 ' ACT GGT TCC ATG 



30 GAG GTC AGC GCC 3 ') encoding amino acids 361 - 370 of BPI and BPI-54 (5 ' 
GAC AG A TCT CTC GAG TO^ TTT ATA GAC AA 3') encoding the last four 
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amino acids of cJ^ding sequence, the stop codon (TGA), and incorporating an Xhol 
site immediately downstream of the stop codon. These oligonucleotides were used 
to PCR amplify a 28(ySbp fragment of die C-terminus of BPI and incorporate the Xhol 
site at the 3 ' end of thV gene. The amplified fragment was digested with Ncol and 
Bg/II and ligated to a -4^00 bp NcoI-BamHI fragment from pIClOS to generate the 
plasmid pSSlOl. 

Construction of Plasmid with BPI (S351A) 

The klycosylation site was next removed by replacing the region from a unique 
Xcml site to la unique Sphl site within the BPI gene in pSSlOl with an annealed 
oligonucleotide\that contained the codon (TCC) for the serine at amino acid position 
351 changed to tlje codon (GCC) for alanine as shown below. 
Wild type 

Xcml \ Sphl 

. . . CCC AAC TCC TCC CTG GCT TCC CTC TTC CTG ATT GGC ATG CAC 
15 . . .GGG TTC AGO AOS GAC CGA AGG GAG AAG GAC TAA CCG TAC GTG 
Pro Asn Ser Ser Leu Ala Ser Leu Phe Leu lie Gly Met His 
35l\ 

Nonglycosylated 

Xcml \ - Sphl 

20 ... CCC AAC TCC GCC CTG GCT TCC CTC TTC CTG ATT GGC ATG CAC 
. . .GGG TTC AGG CGG GAC CGA AGG GAG AAG GAC TAA CCG TAC GTG 
Pro Aaa Ser Ala Le\\ Ala Ser Leu Phe Leu lie Gly Met His 
351 

This step generated the plasnHd pSS102. 

25 To construct the vector, pING4322, for the expression of full length 

nonglycosylated holo BPI in mammalian ceils, pSS102 was digested with BstEl and 
Xhol and a 596 bp fragment, which included the modified BPI sequence, was purified 
and ligated to the large BstBl-Xhol fragment from pING4147 which contains the gpt 
gene encoding resistance to mycophenolic acid, the human Ig enhancer, the human 

30 cytomegalovirus promoter (CMV) and the mouse light chain 3' untranslated region 
and is identical to the vector, pING4144 as described in U.S. Patent No. 5,420,019 
and W094/18323 (PCT/US94/01235) hereby incorporated by reference, except that 
it contains the codon for the native cysteine instead of an alanine at amino acid 
position 132 of the BPI gene. 

35 Stable Transfection of Mammalian Cells for Expression of Nonglycosylated BPI 
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Mammalian ceils are preferred hosts for production of rBPI protein analogs 
as described herein. Such cells permit proper secretion, folding, and 
post-translational modification of expressed proteins. Presently preferred mammalian 
host cells for production of BPI proteins include cells of fibroblast origin, such as 
5 CHO-Kl cells (ATCC CCL61), CHO-DG44 cells (a dihydrofolate reductase [DHFR] 
minus mutant of CHO Toronto obtained from Dr. Lawrence Chasin, Columbia 
University), CHO-DXB-11 (a DHFR' mutant of CHO-Kl obtained from Dr. 
Lawrence Chasin), Vero Cells (ATCC CRL81) and Baby Hamster Kidney (BHK) 
cells (ATCC CRL6281) and cells of lymphoid origin, such as the hybridoma 

10 Sp2/0-Agl4 (ATCC CRL1581) or the myeloma, NSO (ECACC No. 85110503). 

Transfection of mammalian cells can be accomplished by a variety of methods. 
Two of the most common approaches involve calcium phosphate precipitation of the 
expression vector DNA which is subsequently taken by the cells and electroporation, 
which causes the cells to take up the DNA through membrane pores created by the 

15 generation of a strong electric field. Selection for transfected cells is facilitated by 
the incorporation in the expression vector of a gene whose product allows the 
transfected cells to survive and grow under selective conditions. A number of such 
genes have been identified including, among others, the bacterial Tn5 neo gene, 
which encodes resistance to the antibiotic G418 and the Escherichia coli guanine 

20 phosphoribosyl transferase (gpt) gene, which encodes resistance to mycophenolic acid 
(MPA) in the presence of xanthine (Mulligan and Berg, Proc. NatL Acad, Sci. 
78:2072-2076 (1981)), the dihydrofolate reductase (DHFR) gene, which allows for 
growth of DHFR* cells in the absence of nucleosides and gene amplification in the 
presence of increasing concentrations of methotrexate, the glutamine synthetase gene, 

25 which allows for growth of glutamine auxotrophs without glutamine and gene 
amplification in the presence of methionine sulfoximine and the Salmonella 
typhimurium hisD gene and ttie E. coli trpB gene (Hartman and Mulligan, Proc. NatL 
Acad. Sci. 85: 8047-8051 (1988)), which allow growth in the presence of histidinol 
or without tryptophan (in the presence of indole), respectively. The availability of 

30 these selective markers provide significant flexibility for the generation of mammalian 
cell lines that express recombinant products, since they can be used either alone or 
in various combinations to provide cell lines with the highest possible productivity. 
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Transf action of CHO-Kl Cells with pING4322 

The CHO-Kl cell line was raaintained in Ham's F12 medium plus 10% fetal 
bovine serum (FBS). Media were supplemented with glutamine/penicillin/strepto- 
mycin (Irvine Scientific, Irvine, California). 

CHO-Kl cells were transfected by electroporation with 40 fig of pING4322 
DNA that was first digested with Pvul, phenol-chloroform extracted and ethanol 
precipitated. Following the electroporation, the cells were allowed to recover for 24 
hours in non-selective Ham*s F12 medium. The cells were then trypsinized, 
resuspended at a concentration of -5 X 10^ cells/ml in Ham*s F12 medium 
supplemented with MP A (25 ^g/mL) and xanthine (250 ^tg/mL) and plated at — 10* 
cells/well in 96 well plates. Untransfected CHO-Kl cells are unable to grow in this 
medium due to the inhibition of pyrimidine synthesis by the MPA. At -2 weeks, 
colonies consisting of transfected cells were observed in the 96 well plates. 
Supematants from wells containing single colonies were analyzed for the presence of 
BPI-reactive protein by anti-BPI ELISA using BPI23 as a standard. In this assay, 
Immulon-II 96 well plates (Dynatech) were pre-coated with affmity purified rabbit 
anti-BPl23 antiserum, followed by supernatant samples and detection was with affinity 
purified, biotinylated rabbit anti-BPIjj antiserum followed by peroxidase-labeled 
avidin. A total of 100 colonies were screened in this manner. The top isolates were 
transferred to 24 well plates and productivity was assessed as follows. Cells were 
grown to confluence in a 24 well plate in Ham's F12 medium supplemented with 10% 
FBS. Once the cells reached confluence, the Ham*s F12 medium was removed and 
2 ml of HB-CHO serum free medium (Irvine Scientific) plus 40 /xL of S-Sepharose 
beads (Pharmacia) were added. The cells were incubated for 7 days after which the 
S-Sepharose beads were removed and washed with 0. 1 M NaCl in 10 mM Tris buffer 
(pH 7.5). BPI was eluted from the beads by addition of 1.0 M NaCl in Tris buffer. 
The top producers, designated Clones 37 and 91, secreted -17 and 14 ^ig/ml, 
respectively in this assay and were frozen as Research Cell Bank numbers C2020 and 
C2021, respectively. Purified protein was prepared for crystallization studies as 
follows. 

Production and Purification of Nonglycosylated rBPL 
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The host cells used to prepare protein for crystallization studies were CHO-Kl 
cells transformed with the DNA vector pING4322 which includes DNA encoding the 
456 amino acids of human BPI preceded by its endogenous 31 residue secretory 
signal as described above. During post-translational secretory processing, the signal 
sequence residues were removed by the host cell. The desired expression product, 
nonglycosylated rBPI, was a biologically active variant of the human BPI molecule 
in which the amino acid serine at position 351 in the human BPI protein has been 
replaced with an alanine. 

Forty roller bottles were prepared which contained the transfected CHO-Kl host 
cells at 1.3 X 10^ cells per bottle in DME/F12 media supplemented widi 5% fetal 
bovine serum (FBS). The cells were grown for three days, at which time 500 ml of 
fresh media, DME/F12 with 2.5% FBS was added along with a 10 ml slurry 
(approximately 8 gr.) of sterilized S-Sepharose (Pharmacia, fast flow #17-0511-01, 
Uppsula, Sweden) and 1 ml of a 1 M solution of sodium butyrate. After two days, 
the old media plus the S-Sepharose was removed and fresh media, S-Sepharose and 
sodium butyrate were added to each roller bottle. This process of harvesting the 
expressed protein product with S-Sepharose was repeated for a total of three harvests, 
and the S-Sepharose removed during each harvest was pooled. The use of S- 
Sepharose beads to capture recombinant BPI protein products has been described in 
U.S. Patent No. 5,439,807 and WO93/23540 (PCT/US93/04752). 

The expressed nonglycosylated rBPI protein was purified from the pooled S- 
Sepharose by first removing it from the S-Sepharose resin followed by further 
purification and concentration on a series of Q-Sepharose (Pharmacia, fast flow #17- 
0510-01) and CM-Spherodex (Sepracor, #273431, Villeneuve la Garenne, France) 
columns. Following purification, the protein was buffer exchanged utilizing a 
Sephacryl S-1(K) (Pharmacia, high resolution #17-0612-01) column. 

Specifically, the pooled S-Sepharose resin from each harvest was allowed to 
settle for approximately 15 minutes. The media was removed by decanting and the 
settled resin was washed three times with approximately 400 ml of 20 mM MES, pH 
6.8, 150 mM NaCL For each wash, the buffer was added, the mbcture was stirred 
gently and the resin was allowed to resettle for approximately 15 minutes. Each 
buffer wash was removed by decanting. The beads were then washed with 400 mL 
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of 20 mM sodium acetate/acetic acid, 150 mM NaCl, pH 4.0 (acetate buffer), and 
then poured into a 2.5 x 50 cm liquid chromatography column (BIORAD, 
Econocoiumn, Richmond, California). The column was washed extensively with 
approximately 2 liters of 400 mM NaCl-acetate buffer, pH 4.0 until the A280 
5 absorbance reading of the colunm eluate was equal to that of the buffer alone. The 
column was additionally washed with approximately 600 ml of 600 mM NaCl-acetate 
buffer until the A280 absorbance of the eluate was again equal to that of buffer alone. 
The protein was then eluted from the S-Sepharose in approximately 500 ml of LO M 
NaCl-acetate buffer. 

10 The S-Sepharose eluates from each harvest were pooled and diluted to a NaCl 

concentration of 300 mM. The diluted material was then loaded on to a two column, 
serial arrangement of a 100 ml Q-Sepharose colunm connected to a 12 ml CM- 
Spherodex column. Both columns were constructed using new. sterile resin and were 
pre-equilibrated with 20 mM MES, pH 5.5, 200 mM NaCl. The Q-Sepharose 

15 coliunn served to remove any nucleic acid in the sample material, and the protein did 
not bind to this resin. After the approximate 3 liters of protein containing material 
had been loaded, the Q-Sepharose column was disconnected and the CM-Spherodex 
column was washed with buffer until the A280 absorbance of the eluate was the same 
as buffer alone. The protein was eluted from the colunm in 20 mM MES, 400 mM 

20 NaCl, pH 5.5 in a volume of approximately 180 ml. This eluted fraction was then 
reloaded on to a smaller (2 ml) CM-Spherodex column for protein concentration, and 
the bound protein removed in a single step of 20 mM MES, 1.2 M NaCl, pH 5.5 in 
a volume of approximately 12 mL. The protein was then loaded directly on to a 150 
ml pyrogen- free Sephacryl S- 100 column pre-equilibrated with 20 mM sodium citrate, 

25 150 mM NaCl, pH 5.0 buffer. Column fractions were analyzed by Coonunassie- 
stained (0.5% Coonrunassie Brilliant Blue-R, 25% isopropanol, 10% methanol, 10% 
acetic acid) SDS-PAGE and Western analysis. Western analysis was performed 
using a 1:1000 dilution of a rabbit anti-human BPI antisera. Fractions which 
contained the nonglycosylated rBPI protein were pooled and resulted in a final lot 

30 which was greater than 95% pure as analyzed by Coommassie-stained SDS-PAGE. 

The protein samples thus prepared and purified were filtered and/or 
concentrated for crystallization studies of the purified nonglycosylated rBPI protein. 
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Protein samples were optionally filtered using a 0.2 syringe filter (Millipore 
Corp., Bedford, MA) or a 0.2 fim Nalgene filter (Nalge Corp., Rochester, NY) to 
remove precipitate. Protein samples were concentrated in a Centricon 10 (Amicon 
Corp., Beverly, MA) or a Centriprep 10 (Amicon Corp., Beverly, MA). For the 

5 Centricon 10 concentrators, a JA 20 rotor (Beckman, Fullerton, CA) in a J2-21 
Beckman centrifuge was used at 6000 rpm for 60 minutes. For the Centriprep 10 
concentrators, a swinging bucket rotor in a J-6B Beckman centrifuge was used at 
3000 rpm for 60 minutes. Final volumes for various protein samples prepared for 
crystallization studies described herein ranged from about 0.1 to 1 mL, and the 

10 protein concentrations were generally between about 10 and 20 mg/mL. Protein 
solutions may be diluted or concentrated for crystallization studies, for example, from 
about 5 to about 50 mg/ml. 

Example 2: Structure Determination of a Crystallized BPI 
Protein 

15 Presented herein is the crystal structure of BPI and two bound phospholipids 

at 2.4 A resolution. Our model provides the first structural information on the LPS- 
binding and lipid transport protein family and suggests a common mode of lipid 
binding for its members. 

Purified, full-length, non-glycosylated, recombinant human BPI expressed in 

20 CHO cells was crystallized by hanging-drop vapor diffusion at room temperature. 
The protein concentration was 8.5 mg/ml and the crystallization buffer contained 12% 
(w/v) PEG 8000, 200 mM magnesium acetate, and 100 mM sodium cacodylate, pH 
6.8. Two crystal forms with slightly different cell dimensions grew under the same 
conditions in space group Cj, with one molecule per asymmetric unit. Form 1 

25 crystals were reproducible and had cell dimensions of a = 185.0, i>=37.2, c=84.3 A, 
and )3 = 101.3*. Form 2 crystals appeared rarely and had cell dimensions of 
a=185.6, i?=33.0, c=85,2 A. and j3 = 101.6' 

For Table 1, x-ray diffraction data were collected at room temperamre with 
the R-AXIS lie imaging plate area detector mounted on a Rigaku RU200 rotating 

30 anode x-ray generator. Data were processed with DENZO and SCALEPACK 
[Z. Otwinski, in Proceedings of CCP4 Study Weekend: Data Collection and 
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Processing, L. Sawyer, N. Isaacs, S. Baileys, Eds. (SERC Daresbury Laboratory, 
Warrington, UK, 1993), pp. 56]. For form 1 crystals, a native data set to 2.8 A was 
collected from a single crystal, which was 92.4% complete overall (84.9% complete 
with an average I/a(I)=2.3 in the outermost resolution shell). A native data set to 
2.4 A for form 2 was collected from two crystals and was 92.7% complete overall 
(94.6% complete with an average I/a(I)=2.6 in the outermost shell). Because they 
could be reliably reproduced, form 1 crystals were used for all heavy atom soaks. 
The strucmre was solved by multiple isomorphous replacement (MIR) with anomalous 
scattering. Heavy atom sites were identified by difference-Patterson and difference- 
Fourier maps. Phase refinement was performed with [Collaborative Computational 
Project No. 4, Acta Crystallogr., D50:160 (1994)] producing a mean figure of merit 
(FOM) of 0.57. The MIR map (Fig. 3) was improved by density modification 
including solvent flanening, histogram matching, and phase extension using DM 
[Collaborative Computational Project No. 4, supra]. After a partial model was 
obtained with FRODO [T. A. Jones, J. AppL Crystallogr., /7:268 (1978)], phase 
combination was performed with SIGMA A [Collaborative Computational Project No. 
4, supra] (final FOM =0.89). CMNP is chloro-Hg-nitrophenol; DMM is dimethyl 
mercury; PCMBS is parachloromercury-benzene sulfonate; TELA is triethyl lead 
acetate. 
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Table 2 relates to model refinement and statistics. The model was refined at 
2.8 A through iterative cycles of simulated-annealing with X-PLOR [A. T. Briinger 
and A. Knikowski. Acta Crystallogr., A46:5%5 (1990)] and manual rebuilding. 10% 
of the data were set aside before refinement began for R^^ [A. T. Bninger Nature, 
355:412 (1992)] calculations. When the model had been refined to an R- factor* of 
20.4% (Rf^=32.6%) with the 2.8 A data, rigid-body minimization was performed 
against the 2.4 A data set (R=29.8% to 3.5 A after minimization). Additional cycles 
of simulated annealing, positional refinement, correlated individual temperature factor 
refinement, and manual rebuilding reduced the R- factor to 22.7% and Rfree=31.3% 
(no intensity cutoff). An overall anisotropic temperature factor and bulk solvent 
correction were applied to the observed reflections when R^^ showed improvement. 
The model was confirmed by calculating simulated-annealing omit-maps for every 
part of the structure. The final model contains all 456 residues of the protein, 48 
well-ordered waters, and 2 molecules of phosphatidylcholme. Regions of the 
backbone with poor electron density include residues 148, 232-236, 258-260, and 
parts of the loop between residues 281-311. Sidechains with poorly defined density 
were truncated to alanine. The model was examined by the programs PROCHECK 
[R. A. Laskowski, M. W. McArthur, D. S. Moss, J. M. Thornton, /. AppL 
Crystallogr, 26:283 (1993)]. VJERIFY [R. Liithy, J. U. Bowie, D, Eisenberg, Nature, 
556:83 (1992)], and ERRAT [C. Colovos and T. Yeates, Protein ScL, 2:1511 
(1993)]. 
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TABLE 2: REFINEMENT STATISTICS FOR FORM 2 CRYSTALS 

Data 



Resolution (A) 




TJniaue reflections 


lo,898 


Comnleteness ( %^ 




Atoms in model 




Protein ( T^on-hvHmopTl^ 




PhoQnhsitiHvlrhnlinp 


1U2 


Water 


4o 


Refinement narameters 




Resolution ranpe (A\ 


DiJ.U-2.4 


R-factor* (%) 


22.7 


Rfr.e(%) 


31.3 


Avg. atomic B factors 




Protein 


36.9 


Lipid N. C 


49.4, 51. Ot 


Waters 


44.6 


rmst deviation from ideality 




Bonds (A) 


0.006 


Angles (deg) 


1.4 


Dihedrals (deg) 


26.0 


Improp)ers (deg) 


1.2 



*R = ! 00 (X I F^- I )/(! F^) where F^ and F^ are the observed and calculated structure factors, respectively. 
tB factors for the lipids bound in the NHj- and COOH -terminal domains. 
25 trms"rooi mean square 



BPI is a boomerang-shaped molecule with approximate dimensions of 135 by 
35 by 35 A (Fig. 1, A and B). It consists of two domains of similar size (NHj- and 
COOH-terminal) that are connected by a proline-rich linker of 21 residues (positions 
230 to 250). The two domains form three structural units; barrels are found at each 

30 end of the protein, and a central 0 sheet forms an interface between the barrels. The 
secondary structure and topology of the two domains are similar, giving the protein 
pseudo-twofold synmietry. 

Each barrel (residues 10 to 193 and 260 to 421) contains three common 
structural elements: a short oc helix, a five-stranded antiparallel 0 sheet, and a long 

35 helix (Fig. 2A), in that order. We call these elements helix A, sheet N and helix B 
in the NH2-ierminal domain, and helix A', sheet C and helix B' in the COOH- 
terminal domain. Sheets N and C have a series of /9 bulges that change the direction 
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of their strands and cause a pronounced curve in the sheets. In each domain, the long 
helix lies along the concave face of the sheet, with the helical axis at -60* to the 
strands of the 0 sheet. A single disulfide bond between Cys*^^ and Cys'^^ anchors 
helix B to the final strand of sheet N. Situated between the NHj- and COOH-terminal 
5 barrels is a twisted, seven-stranded antiparallel 0 sheet composed of four strands from 
the NHj-terminal domain and three strands from the COOH-terminal domain. This 
central sheet forms an interface between the two domains and is thus reminiscent of 
several dimer interfaces stabilized by hydrogen bonds between strands of a /3 sheet 
[M. Leeson, B. Henderson, J. Gillig, J. Schwab and J. Smith, Structure, 4:253 

10 (1996); D. Ohlendorf, W. F. Anderson, M. Lewis, C. O. Pabo, B. W. Matthews. 
/. MoL BioL, 169:151 (1983); G. N. Reeke, J. W. Becker, G. M. Edelman, 7. BioL 
Chem.. 250:1515 (1975)]. 

The structural similarity of the two domains of BPI is shown by the 
superposition [G. H. Cohen, 7. MoL BioL, 790:593 (1986)] in Fig. 2B; they are 

15 related by a rotation of 173' and have a root mean square deviation (rmsd) of 3 .0 A 
on the basis of superposition of 169 Ca pairs. The structure shared by these two 
domains does not resemble other protein folds; several structural alignment programs 
[N. N. Alexandrov and D. Fischer, Proteins, 25:354 (1996); D. Fischer, C. J. Tsai, 
R. Nussinov, Protein Eng., 5:981 (1995); L. Holm, C. Sander, NucL Acids Res,, 

20 22:3600 (1996)] failed to reveal a significant match to any known folds. Significant 
differences between the superimposed domains are found in two loop regions 
containing residues 45 and 96 in the NHj-terminal domain and residues 280 and 348 
in the COOH-terminal domain. These differences may be functionally important 
because the loops around residues 45 and 96 in the NHj-terminal domain have been 

25 implicated in LPS binding and bactericidal activity (see below). This structural 
similarity of the two domains was unexpected, not only because of their lack of 
significant sequence identity ( < 20%), but also because of their functional differences. 
The NHj- terminal domain of BPI is cationic and retains the bactericidal, LPS-binding, 
and LPS-neutralizaiion activities of the intact protein [A. H. Horwitz, et al.. Protein 

30 Expr. and Purif., 5:28 (1996); C. E. Ooi, J. Weiss, P. Elsbach, B. Frangione and 
B. Mannion, /. BioL Chem., 262:14891-14894 (1987); C. E. Ooi, J. Weiss, M. E. 
Doerfler and P. Elsbach, 7. Exp, Med., 174:649 (1991)]. The COOH-terminal 
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domain is essentially neutral and shows limited LPS-neutralization activity [S. L. 
Abrahamson et al., 7. Biol. Chem., 272:2149 (1997)]. However, the strucniral 
similarity of the two domains may reflect a previously undetected functional 
similarity: each domain contains a binding pocket for a phospholipid, 
5 After the amino acid sequence had been traced in the electron density maps, 

two regions of extended electron density remained that could not be accounted for by 
protein atoms. This density, found in the interior of both domains, was present in 
the multiple isomorphous replacement (MIR) maps (Fig. 3) at an intensity similar to 
that of the protein density, and it became the predominant feature in F^^-F^j^ maps 

10 after sequence fitting (both form 1 and form 2 crystals). Electrospray mass 
spectrometry of the sample used for crystallization revealed two molecules, with 
relative molecular masses of 522 and 787, in approximately equal amounts. Tandem 
mass spectrometric analysis was consistent with the two species being 
phosphoglycerides containing a phosphatidylcholine head group and either one or two 

15 18-carbon acyl chains with one double bond. Phosphatidylcholine (Fig. 4A) is 
abundant in eukaryotic cells and is presumably bound by BPI in the cells from which 
the protein is isolated. 

The two lipids are bound in extensive apolar pockets on the concave surface 
of the boomerang, situated between the NHj-terminal and COOH-terminal barrels and 

20 the central jS sheet. In the NHj-terminal domain, the entrance to the pocket is formed 
by helices A and B. The back and sides are formed by sheet N and the central sheet. 
The two acyl chains insert — 15 A into the interior of the protein and are surrounded 
by apolar side chains (Fig. 4B). The head group lies at the entrance of the pocket 
and is exposed to solvent. The pocket in the COOH-terminal domain, which has a 

25 slightly larger opening, is formed by the analogous secondary structures. Both basic 
and acidic side chains found near the entrances of the pockets are available for 
electrostatic interactions with the zwitterionic head group. When the lipids are 
removed from the model, the pocket in NHj- terminal domain has a solvent accessible 
surface area [M. L. Connolly. Science, 221.109 (1983); M. L. Connolly, J. Am. 

30 Chem. Soc, 707:1118 (1985)] of 557 A^ and the pocket in the COOH-terminal 
domain has an area of 413 A\ for a total of 970 A^ The intensity of the electron 
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density for the two acyl chains in both pockets is similar and does not indicate 
whether the single acyl chain species is found predominantly in either pocket. 

The discovery of bound phospholipid in our structure suggests a possible site 
of interaction between BPI and LPS. As seen in Fig, 4A, phosphatidylcholine and 

5 LPS share some structural similarity, including negatively charged phosphate groups 
and, most notably, acyl chains. Since BPI's function is to bind a lipid, LPS, and 
since lipid is bound in pockets of BPI, it seems reasonable that the acyl chains of LPS 
bind in the apolar pockets. The following observations support his hypothesis: i) the 
acyl chains of lipid A are known to be essential for binding by BPI [H. Gazzano- 

0 Santoro et al.. Infection and Immunity^ (5i:2201 (1995)]; ii) the binding pockets of 
BPI are reminiscent of cavities in other lipid-binding proteins [L. Banaszak et aL, 
Adv, Protein Chem., 45:89 (1994)]; and iii) BPI has a significant sequence similarity 
to two lipid transfer proteins (see below). 

Our proposed site of interaction between BPI and the acyl chains of LPS 

5 differs from that suggested by previous work focusing on the NH2-terminal domain. 
Fragments containing the NHj-terminal domain of BPI have been identified with 
equivalent or greater bactericidal and LPS-binding activities relative to the full-length 
protein [A. H. Horwitz, et al.. Protein Expression and Purification, 8:28 (1996); C. 
E. Ooi, J. Weiss, M, E. Doerfler and P. Elsbach, 7. Exp. Med., 774:649 (1991)]. 

3 The activity of one NHj-terminal fragment was reduced when residues past positions 
12 or between positions 169 and 199 were deleted [C. Capodici and J. Weiss, J. 
Immunol., 756:4789 (1996)]. The structure shows that these deletions affect elements 
of the barrel (at the beginning of helix A and from the middle to the end of helix B) 
and could significantly alter its structure. While the barrel seems to be the minimal 

5 structural unit with full activity, three smaller regions of this domain retain significant 
LPS-binding, LPS-neutralization, and bactericidal activity [R. G. Little, D. N. 
Kelner, E. Lim, D. J. Burke and P. J, Conlon. 7. Biol. Chem., 265:1865 (1994)]: 
residues 17-45 (most of helix A and the first /3 strand of sheet N), residues 82-108 
(a )3 hairpin [residues 82 to 106 of BPI show limited sequence similarity with residues 

) 32 to 51 of the limulus anti-LPS factor (LALF), and have been predicted to form an 
amphipathic 0 hairpin similar to that seen in the LALF structure [A. Hoess, S. 
Watson, G. R. Silber and R. Liddington, EMBO 72:3351 (1993)]. Although this 
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region of BPI does form a 0 hairpin, the strict amphipathic character of the loop seen 
in LALF is not maintained, and a structural superposition shows that the sequence of 
BPI must be shifted by one residue relative to the proposed sequence alignment] 
between strands 3 and 4 of sheet N), and residues 142-169 (a segment preceding and 

5 including part of helix B). These three regions include 18 basic residues (and only 
4 acidic residues) and form a positively charged tip on the NHj-terminal domain (on 
the left of Fig. 1) which may make favorable electrostatic interactions with negatively 
charged groups of LPS. Further studies are necessary to determine the relative 
importance of the apolar pockets and positively charged NHz-terminal tip to BPI's 

0 LPS-binding and bactericidal activities. 

BPI is the first member of the manunalian LPS-binding and lipid transfer 
family to have its three-dimensional structure determined. BPI and LBP are related 
to two lipid transfer proteins, cholesteryl ester transfer protein (CETP) and 
phospholipid transfer protein (PLTP) [A. Tall, Annu. Rev. Biochem., 64:235 (1995)]. 

5 Alignment of the amino acid sequences of human BPI. LBP. CETP, and PLTP with 
EPFs secondary structure (Fig. 5, A and B) shows that strucmrally important residues 
are conserved in the four proteins. The two cysteines that form the single disulfide 
bond and are critical to the function of BPI [A. H. Horwitz, et al.. Protein Expression 
and Purification, S:28 (1996)] are completely conserved. Also, the pattern of 

3 hydrophobic/hydrophilic residues in the 0 strands indicates that the 0 bulges 
responsible for the extensive sheet twisting are preserved. The conserved sequences 
strongly suggest that members of the LPS-binding and lipid transfer family share 
EPFs two-domain structure and that the two domains are similar in topology. 

It is likely that the lipid transfer proteins will also share the apolar binding 

5 pockets found in BPI. Striking parallels are found between our BPI- 
phosphatidylcholine structure and previous work showing that CETP copurifies with 
an equimolar amount of phosphatidylcholine [A. Tall, Anna. Rev. Biochem., 64:235 
(1995)] and has two distinct binding sites [S. Wang, L. Deng, R. W. Milne and A. 
R. Tall, J. BioL Chem., 267:11467 (1992)] - one for neutral lipids and another for 

) phospholipids. The known ligands of CETP and PLTP (cholesteryl esters, 
triglycerides, retinyl esters and phospholipids) all contain at least one acyl chain 
which could .bind in apolar pockets similar to those in BPI, suggesting a conunon 
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mode of ligand binding in this family. Sequestration of these hydrophobic chains in 
interior pockets may be critical to the function of the lipid transfer proteins: transfer 
of apolar ligands in an aqueous environment. Thus, the structure of BPI illuminates 
the action of the plasma lipid transfer proteins, as well as offering possibilities for 
5 how BPI and LBP interact with LPS. 



Example 3: Molecular Modeling of BPI Ligands and Mimetics 
We have used the information derived from the X-ray crystal structure of BPI 
presented herein, along with the teachings of the art, including, for example, 
WO94/20532 (PCT/US94/02465) to design various BPI-related proteins and peptides. 
These constructs may be divided into categories as illustrated below, including 
peptides and proteins, including fragments, analogs and variants of the protein, since 
they best describe the different ways in which different domains and portions may be 
assembled to achieve new molecules. 

1 . Individual Peptide Domains: The overlapping BPI peptide data indicated that 
the N-ierminal domain of BPI contains at least three independent functional domains 
that have one or more of the biological activities of BPI, including, for example, 
antibacterial, antifungal, ami-heparin and anti-angiogenic activities. Domain I is a 
region of amino acid residues from about 17 to about 45; Domain II is a region of 
amino acid residues from about 65 to about 99; and Domain III is a region of amino 
acid residues from about 142 to about 169. Hundreds of peptide sequences derived 
from these domains have been synthesized, including addition, deletion and 
substitution variants of the domain-derived sequences. Through further refinements, 
smaller "core" regions within these domains have been identified that still retain high 
levels of activity; for example, within Domain II residues 90-99 and within Domain 
III residues 148-161. 

These peptides have included linear molecules that may or may not assume a 
conformation that maximally express activity. From the X-ray structure data, 
segments of BPI are designed that should preserve the three-dimensional structure of 
these domains when constructed outside the context of the intact protein. For 
example, both Domains I and II contain hairpin loop structures that are positioned 
adjacent to one another in space on the proximal tip of the molecule. In contrast. 
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although Domain III is a helix + turn and not a loop, extensions from both ends of 
the domain are positioned near enough to each other to consider linking them 
together. As a result, peptides can be designed that reflect these structures by 
replacing selected residues in synthetic or recombinant peptides with cysteines, so as 
5 to create disulfide-stablized domain mimetics. Since this approach is based upon the 
actual structure of BPI, it differs from those of other groups that are based upon 
putative alignments between BPI and the structure of proteins such as the Limulus 
amoebocyte lysate factor (LALF). As examples of these embodiments, listed below 
are a series of exemplary peptides that, with cysteines added to the N- and C-terminus 
10 could assume structures similar to those seen in the intact protein: from Domain I: 
residues 36-54; from Domain II: residues 84-109, 85-108, 86-107, 87-106, 88-105, 
89-104, or 90-103; and from Domain III: residues 142-164. 

2. Peptide Domain Hybrids: It has also been demonstrated that certain peptide 
domain hybrids, which include repeats of the same sequence from a single domain 

15 or inter-domain combination of sequences, have enhanced activities. For example, 
linking Domain II- and Ill-derived peptides (such as XMP.29: 85-99:: 148-161) has 
enhanced biological activity. Interestingly, these domains in the crystal structure are 
closely associated in space, and peptide XMP.29 may represent a Domain II::III 
hybrid that actually shares some structural similarity to the intact protein. Based upon 
20 the X-ray structure, a Domain II-III peptide that consists of approximately residues 
D 90- 103:: 146- 162 is constructed . Such a peptide may even more closely mimic what 

^ is seen in the protein. 

3. BPI "Tip" Mimetics: As discussed above, portions of all three peptide 
domains discovered and described in WO94/20532 come together on the proximal tip 

25 of the N-terminal fragment. As a result, a BPI "tip" mimetic is designed that 
essentially "slices" off the most extreme tip of the molecule but preserves the critical 
domain elements. Such a slice would lack the hydrophobic pockets found in the 
intact protein, but may exhibit activities beyond those of the individual segments. 
The following segments represent such a "slice" of the three peptide domains. 

30 However, to best preserve the geometry between the domains, it would be desirable 
to insert "linker" sequences between them so as to ensure proper positioning. By 
fixing these segments in space, programs such as Insightll (Molecular Simulations, 
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Inc.) can identify possible linker sequences by i) searching protein databases for 
similar structures or ii) de novo designing appropriate linkers. In this regard ii may 
be desirable to utilize residues that are not readily susceptible to proteolysis (Ala, Ser, 
Gly, etc.), or to utilize amino acids like Pro that impose additional spatial constraints 
5 on peptide structure. An exemplary peptide consists of Domain I-II-III-derived 
elements: 37-54:90-104:144-162. 

Similarly, by analogy with the above-described cyclic domain structures, the 
fact that residues 37 and 162 are positioned near each other in the protein suggests 
that a cyclic tip mimetic could be created by replacing these residues with cysteines, 
10 for example. Cys::38-54:90-104: 144.161::Cys. 

4. Extended N-terminal Domains of BPI: The three dimensional structure of 
BPI indicates that the molecule, which forms N- and C-terminal domains, can be 
divided into three structural units as described in Example 2. Two of these units 
represent the N- and C-terminal "barrels" that are formed by residues 10-193 and 

15 260-421, respectively, whereas the third element is a central B sheet strucnire that 
forms the interface between the two barrels. Of interest is the fact that the two bound 
phospholipids in BPI occupy spaces between the two barrel structures and the central 
6 sheet. As a result, the recombinant BPI protein product rBPIji. which essentially 
contains residues 1-193 lacks some of the structural components necessary to form 

20 a complete hydrophobic pocket. A new molecule is constructed that encodes residues 
1 to approximately 260 which would contain most of the residues necessary to form 
a complete hydrophobic pocket. 

5. Mutants for Inunobilizing rBFIj,: One application for BPI protein products 
is their use as affinity removal ligands for endotoxin in solution. For example, 

25 immobilizing a BPI protein on a column or membrane matrix would allow removal 
of endotoxin from endotoxin-contaminated solutions by simply passing those solutions 
over the immobilized BPI protein. Some of the cysteine-mutated peptides described 
above may be useful for this purpose, as well as rBPlja- Alternatively, in order to 
selectively couple a stable, readily produced protein like rBPl2i to a column or 

30 membrane, a cysteine could be added to the N- or C-terminus, thus allowing site- 
specific conjugation and selective orientation of the binding "tip" away from the solid 
support. Such a construct is alternatively constructed by adding a shon linker 
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segment (such as Gly-Gly-Gly-Ser) to the C-terminal residue of the BPI protein 
product, for example, residue 193 of rBPI(l-193), followed by a cysteine residue. 
Such a construct would have a high probability of folding correctly, given the domain 
namre of residues 1-193. and be readily conjugatable. Similarly, a series of new 
5 conjugates between rBPI(l-193)C and other thiol-containing proteins or molecules is 
created for the purposes of evaluating new molecules. 

6. New N-terminal Dimeric Molecules: As an extension of the above analysis, 
a new series of N-terminal dimeric molecules can be constructed that take better 
advantage of the hydrophobic pockets. For example, by replacing the C-terminal 

10 barrel with another copy of the N-terminal barrel, an analog of BPI would be created 
that contains two functional barrels and possibly two functional hydrophobic pockets. 
One such dimer could be constructed by replacing residues 260-456 with residues 1- 
193. Alternatively, other more central locations may be identified within the 6 sheet 
structure where synunetry would dictate additional and even better points for 

IS duplication. 

7. C-lerminal Fusion Proteins: The C-terminal domains of LBP and GET? 
appear to mediate interactions with CD 14 and lipoproteins. Similarly, the C-terminal 
domain of BPI has LPS binding and neutralization activities. As a consequence, the 
C-terminal barrel of BPI (or other family members) could be fused to barrels or 

20 domains of other family members and/or to other proteins to alter/modify/enhance 
their action. 

8. Homology modeling of BPI family members: The BPI coordinates have 
been useful in the generation of molecular models of other members of the BPI 
protein family. Utilizing CLUSTAL (a multiple sequence alignment program in PC- 
IS Gene) and the Homology module (a structure-based homology modeling program in 

Insightll on the SGI), molecular models (and the corresponding three-dimensional 
coordinates files) of lipopoly saccharide binding protein (LBP), cholesteryl ester 
transfer protein (CETP) and phospholipid transfer protein (PLTP) have been 
generated. With these files, existing mutants are mapped and new ones designed. 
30 Published data [Wang et al.. Biochemistry 30:3484-3490, (1991)] indicate that 
insertional alterations in three locations of CETP severely impaired cholesteryl ester 
transfer activity: residues 48-53, residue 165, and residues 373-379. Since residues 
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48-53 and residue 165 of CETP coincide structurally with Domains 1 and III of BPl 
respectively, the functional domain structure of BPI extends to the other protein 
family members. Similarly, by virtue of the symmetry between the N- and C- 
terminal domains, it is likely that the corresponding residues on the C-terminal tip of 
BPI are involved in recognizing receptors and/or interacting with lipoproteins. 
9. Lipid Pocket Mutants: A deuiled compilation of the residues in BPI 
which form the pockets is described in Table 3 as follows. Column 1 of Table 3 
indicates the residue name and number. Column 2 shows checked residues which 
indicate the residues that show a change in solvent accessible surface area with lipid 
binding. Column 3 shows checked residues that indicate the residues that have some 
atom within 4 A of a lipid atom. If the contact is to the head-group of the lipid, the 
residues are listed at the end. under ENTRY ONLY. Column 4 indicates 
conservation in 3 BPI and 4 LBP sequences, e.g. , for He 68, in 3 of the 7 sequences, 
the residue is similarly He; for the other 4 sequences, the residue is Leu (see also 
column 5). Column 5 indicates alternative residues which occur in BPI or LBP at 
that position for the 7 BPI and LBP sequences analyzed. Column 6 indicates residues 
for mutations to block the pockets, using residues selected to be well-conserved 
(especially in the N-terminal domain) and relatively small. The suggested mutations 
are all to large sidechains in order to decrease the size of the pocket by as much as 
possible. 



TABLE 3 



N-TE 


RMINAL POCKET - residues contributing to interior 


Residue' 


ASA^ 


Within 
4A^ 


Conserva- 
tion* 


Alterna- 
tives* 


Mutations* 


Val 7 


V 


V 


2/7 


A.T 




He 9 


V 


V 


All 




W 


Gly 13 


V 


V 


All 






Leu 14 


V 




All 






Tyr 16 


V 




All 






Ala 17 


V 


V 


All 




F 



25 
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Ser 18 


V 


V 


1/7 


C.A 






Gly 21 


V 


V 


All 




F 




Ala 24 


V 


V 


4/7 


V.T.S 


Y.H 




Leu 25 


V 




All 






5 


He 68 


V 


V 


3/7 


L 






Leu 76 


V 


V 


AH 








Phe 78 


V 


V 


1/7 


V.L 






Leu 117 


V 


V 


All 








Leu 119 


V 




AH 






10 


Pro 128 


V 




5/7 


A,S 






He 130 


V 




1/7 


V 






Val 178 


V 




2/7 


L,l 






Val 182 


V 




All 








GIu 185 


V 


V 


1/7 (allele) 


K,D.H 




15 


Leu 186 


V 


V 


All 




W 




Tyr 189 


V 


V 


All 








Phe 190 


V 


V 


2/7 


V,L 






Leu 193 


V 


V 


All 








Leu 220 


V 


V 


All 






20 


Val 222 


V 




5/7 


M,W 






Val 254 


V 




6/7 


I 






Leu 256 


V 




3/7 


F 






Pro 428 


V 


V 


All 








Thr429 


V 


V 


1/7 


M.L 




25 


Pro 430 


V 


V 


5/7 


L 






Val 433 






3/7 


I 






Leu 435 


V 




All 








Val 453 


V 




6/7 


I 






ENTRY ONLY 


30 


Gin 20 


V 




3/7 


E 
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1 vc 01 


V 


, / 
V 


311 


R,S 




rilii OQ 

OIU Zo 


V 




6/7 


K 




Arg 31 


V 




2/7 


K.E 




ber lol 


V 


V 


5/7 


T.A 




Arg 432 


V 


V 


3/7 


K.Y.H 




lyr 455 


V 


V 


6/7 


H 




C-TE 


RMINAL POCKET - residues contributins to int 


erior 


Phe 263 


V 


V 


All 






Asn 264 


V 




All 






Ala 266 


V 


V 


All 




F 


Gly 267 


V 


V 


2/7 


A.S.T. 




Val 275 


V 


V 


1/7 


A.Y 




Leu 276 


V 


V 


5/7 


F.W 


F.W 


Lys 277 


V 




1/7 


G,N 




Met 278 


V 


V 


1/7 


L.F 




Val 318 


V 




1/7 


L.I.G 




Ala 320 


V 




in 


V 




Pro 324 


V 


V 


6/7 


Q 




Leu 326 


V 


V 


6/7 


V 




Phe 335 


V 


V 


1/7 


L.V.E 




Pro 337 


V 


V 


5/7 


A,F 




Val 339 


V 




2/7 


L.M 




Met 360 


V 


V 


2/7 


L.V 




Thr 362 


V 


V 


5/7 


L 




Val 368 


V 


V 


2/7 


I.L 




Leu 375 


V 


V 


3/7 


I.V 




Val 376 


V 


V 


2/7 


I.T 




Gly 377 


V 




All 






Leu 379 


V 


V 


All 






Leu 381 


V 


V 


3/7 


P 
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Val 409 


V 


V 


Ml 


L,M.I 




Val 413 


V 


V 


1/7 


F.L 


F 


Val 417 


V 


V 


3/7 


LF 


W 


Lys 420 


V 


V 


5/7 


E 


Y.H 


Leu 421 


V 


V 


5/7 


I.F 




Phe 425 


V 


V 


6/7 


L 




ENTRY ONL 


Y 


Asp 200 


V 




All 






Ser 201 


V 


V 


4/7 


K.T.N 




Val 202 


V 




4/7 


F.I 




Tyr 270 


V 


V 


All 






Arg 416 


V 


V 


1/7 


K.V.D 




Lys 423 


V 




3/7 


R,E.Q 





' Residue name and number. 

15 2 Checked residues show a change in solvent accessible surface area with lipid 
binding. 

^ Checked residues have some atom within 4 A of a lipid atom (if the contact is 
to the head group of the lipid, the residues are listed at the end, under ENTRY 
ONLY). 

20 Conservation in 3 BPI and 4 LBP sequences, e.g.. for He 68, in 3 of the 7 

sequences, the residue is similarly He; for the other 4 sequences, the residue is 
Leu (see note 5). 

* Indicates alternative residues which occur in BPI or LBP at that position for the 
7 BPI and LBP sequences analyzed. 

25 ^ Indicates residues for mutations to block the pockets using residues selected to 
be well-conserved (especially in the N-terminal domain) and relatively small. 
The suggested mutations are all to large sidechains in order to decrease the size 
of the pocket by as much as possible. 
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ATOM 1 

ATOM 2 

5 

ATOM 3 

ATOM 4 

10 ATOM 5 

ATOM 8 

ATOM 10 

15 

ATOM 11 

ATOM 13 

20 ATOM 14 

ATOM 15 

ATOM 1 6 

25 

ATOM 17 

ATOM 2 0 

30 ATOM 2 1 

ATOM 22 

ATOM 23 

35 

ATOM 24 

ATOM 2 5 

40 ATOM 2 6 

ATOM 27 

ATOM 2 8 

45 

ATOM 2 9 

ATOM 31 

50 ATOM 32 

ATOM 33 

ATOM 34 

55 

ATOM 36 

ATOM 37 

60 ATOM 3 8 

ATOM 3 9 

ATOM 40 

65 

ATOM 41 

ATOM 4 2 



CB 


VAL 


A 


1 


CGI 


VAL 


A 


1 


CG2 


VAL 


A 


1 


C 


VAL 


A 


1 


O 


VAL 


A 


1 


N 


VAL 


A 


1 


CA 


VAL 


A 


1 


N 


ASN 


A 


2 


CA 


ASN 


A 


2 


CB 


ASN 


A 


2 


CG 


ASN 


A 


2 


ODl 


ASN 


A 


2 


ND2 


ASN 


A 


2 


C 


ASN 


A 


2 


O 


ASN 


A 


2 


N 


PRO 


A 


3 


CD 


PRO 


A 


3 


CA 


PRO 


A 


3 


CB 


PRO 


A 


3 


CG 


PRO 


A 


3 


C 


PRO 


A 


3 


0 


PRO 


A 


3 


N 


GLY 


A 


4 


CA 


GLY 


A 


4 


C 


GLY 


A 


4 


o 


GLY 


A 


4 


N 


VAL 


A 


5 


CA 


VAL 


A 


5 


CB 


VAL 


A 


5 


CGI 


VAL 


A 


5 


CG2 


VAL 


A 


5 


C 


VAL 


A 


5 


O 


VAL 


A 


5 


N 


VAL 


A 


6 



TABLE 4 






95 . 


. 942 


11 . 


.564 


97. 


.217 


12. 


313 


94 . 


.694 


12 . 


,383 


96 . 


.124 


12. 


.502 


97 . 


.238 


12 . 


.990 


97 . 


,161 


10. 


372 


95 . 


982 


11. 


,216 


95 . 


020 


13 . 


016 


95 . 


067 


14 . 


266 


93 . 


709 


14 . 


566 


93 , 


494 


13 . 


831 


94 . 


323 


13 . 


020 


92 . 


380 


14 . 


,108 


95. 


.439 


15 . 


.376 


94 , 


946 


15. 


.399 


96, 


,373 


16 . 


.260 


97 . 


, 144 


16 , 


.244 


96. 


.806 


17 . 


.367 


98. 


, 083 


17, 


.830 


97. 


.765 


17. 


.619 


95 . 


.765 


18 , 


.482 


95 . 


.104 


18 


.758 


95. 


.615 


19 


.108 


94 , 


.653 


20 


.182 


95 


. 178 


21 


.508 


94 


.407 


22 


.425 


96 


.4 94 


21 


.621 


97 


.134 


22 


.842 


97 


.671 


23 


.681 


98 


.573 


24 


.785 


96 


.526 


24 


.283 


98 


.308 


22 


.436 


99 


.014 


21 


.486 


98 


.492 


23 


.139 



18 . 


.390 


1 . 


00 


51. 


23 


18 . 


, 805 


1. 


00 


48. 


90 


18 . 


,720 


1. 


00 


50. 


67 


16 . 


.083 


1 . 


00 


47 . 


46 


15 . 


.911 


1 . 


00 


49. 


54 


16 . 


.554 


1. 


00 


51. 


03 


16 , 


, 876 


1. 


00 


48. 


7 9 


15 . 


554 


1 . 


00 


42. 


80 


14 . 


810 


1. 


00 


38. 


18 


14 . 


185 


1. 


00 


39. 


01 


12 . 


. 892 


1. 


00 


40. 


06 


12 . 


.487 


1 . 


00 


42. 


94 


12 , 


.228 


1 . 


00 


39. 


49 


15 . 


.786 


1. 


00 


34 . 


97 


16 , 


.916 


1. 


00 


37. 


36 


15. 


.397 


1. 


00 


30, 


80 


14 , 


.143 


1. 


.00 


28, 


90 


16 


.252 


1, 


,00 


28. 


19 


15 


.571 


1. 


00 


26 . 


.30 


14 


.128 


1. 


,00 


26 . 


.17 


16 


.283 


1. 


,00 


28. 


.00 


15 


.275 


1, 


.00 


28 


.89 


17 


.444 


1. 


.00 


26. 


.31 


17 


.588 


1 


.00 


25 


.44 


17 


.091 


1. 


.00 


25 


.34 


16 


.831 


1 


.00 


28 


,37 


16 


.981 


1 


.00 


23 


.98 


16 


.519 


1 


. 00 


21 


.80 


17 


.669 


1 


.00 


18 


.63 


17 


.171 


1 


. 00 


20 


.04 


18 


.468 


1 


.00 


20 


.89 


15 


.660 


1 


.00 


24 


.04 


15 


.984 


1 


.00 


27 


.65 


14 


.551 


1 


.00 


25 


.94 
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ATOM 


44 


CA 


VAL A 


6 


ATOM 


45 


CB 


VAL A 


6 


ATOM 


46 


CGI 


VAL A 


6 


ATOM 


47 


CG2 


VAL A 


6 


ATOM 


48 


C 


VAL A 


6 


ATOM 


49 


0 


VAL A 


6 


ATOM 


50 


N 


VAL A 


7 


ATOM 


52 


CA 


VAL A 


7 


ATOM 


53 


CB 


VAL A 


7 


ATOM 


54 


CGI 


VAL A 


7 


ATOM 


55 


CG2 


VAL A 


7 


ATOM 


56 


C 


VAL A 


7 


ATOM 


57 


0 


VAL A 


7 


ATOM 


58 


N 


ARC A 


8 


ATOM 


60 


CA 


ARG A 


8 


ATOM 


61 


CB 


ARG A 


8 


ATOM 


62 


CG 


ARG A 


8 


ATOM 


63 


CD 


ARG A 


8 


ATOM 


64 


NE 


ARG A 


8 


ATOM 


66 


CZ 


ARG A 


8 


ATOM 


67 


NHl 


ARG A 


8 


ATOM 


70 


NH2 


ARG A 


8 


ATOM 


73 


C 


ARG A 


8 


ATOM 


74 


0 


ARG A 


8 


ATOM 


75 


N 


ILE A 


9 


ATOM 


77 


CA 


ILE A 


9 


ATOM 


78 


CB 


ILE A 


9 


ATOM 


79 


CG2 


ILE A 


9 


ATOM 


80 


CGI 


ILE A 


9 


ATOM 


81 


CDl 


ILE A 


9 


ATOM 


82 


C 


ILE A 


9 


ATOM 


83 


0 


ILE A 


9 


ATOM 


84 


N 


SER A 


10 



67 



99. 


589 


22 . 


874 


13 . 


633 


1 . 


00 


26 . 


54 


99. 


082 


22 . 


292 


12. 


294 


1 . 


00 


29 . 


26 


100. 


253 


22 . 


038 


11. 


357 


1 . 


00 


31 . 


93 


98 . 


318 


21. 


001 


12 . 


535 


1 . 


00 


30 . 


52 


100. 


289 


24 . 


191 


13 . 


355 


1 , 


00 


24 . 


99 


99. 


638 


25. 


202 


13 . 


135 


1 . 


00 


26 . 


97 


101. 


611 


24. 


183 


13 . 


416 


1 . 


00 


26 . 


91 


102 . 


404 


25. 


371 


13 . 


156 


1 , 


00 


27 . 


18 


103. 


401 


25. 


637 


14 . 


298 


1. 


00 


27 . 


59 


104 . 


343 


26. 


773 


13 . 


925 


1 . 


00 


27. 


21 


102 . 


659 


25. 


967 


15 . 


575 


1 . 


00 


27 . 


05 


103 . 


190 


25. 


103 


11. 


869 


1. 


00 


29. 


61 


103 . 


863 


24. 


084 


11. 


,777 


1. 


00 


32 . 


24 


103 . 


077 


25. 


994 


10. 


.917 


1. 


00 


33 . 


04 


103, 


,801 


25, 


.841 


9. 


.668 


1. 


00 


33 . 


25 


102 . 


,842 


25. 


.631 


8. 


.494 


1. 


00 


35, 


,70 


102, 


.195 


24. 


.244 


8. 


.485 


1, 


00 


41. 


64 


101. 


.309 


24 . 


.004 


7. 


.259 


1. 


.00 


44 . 


.24 


102 


.049 


23 


.578 


6 


.073 


1, 


.00 


48 


.00 


101, 


.565 


23 


.626 


4 


.835 


1, 


.00 


51. 


.86 


100 


.341 


24 


.091 


4 


.611 


1 


.00 


52 


.72 


102 


.293 


23 


.177 


3 


.820 


1 


.00 


53 


.10 


104 


.682 


27 


.052 


9 


.441 


1 


.00 


32 


.06 


104 


.196 


28 


.168 


9 


.288 


1 


.00 


32 


.62 


105 


.986 


26 


.830 


9 


.515 


1 


.00 


31 


.21 


106 


.955 


27 


.888 


9 


.315 


1 


.00 


31 


.97 


108 


.210 


27 


.633 


10 


.167 


1 


.00 


32 


.76 


XU 7 


0 p 
. ^ u ts 




. /OX 






]^ 


. 00 


29 


. 90 


107 


.803 


27 


.454 


11 


.633 


1 


.00 


32 


.76 


108 


.945 


27 


.087 


12 


.554 


1 


.00 


36 


.13 


107 


.309 


27 


.869 


7 


.832 


1 


.00 


34 


.24 


107 


.650 


26 


.845 


7 


.265 


1 


.00 


37 


.01 


107 


.179 


29 


.045 


7 


M95 


1 


.00 


34 


.50 
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68 



ATOM 


86 


CA 


SER A 


10 


107 . 


473 


29 . 




J . I f *k 


1 Art no 

X . 00 33.98 


ATOM 


87 


CB 


SER A 


10 


106 . 


536 


30 . 


200 




± . K)Q 33.06 


ATOM 


88 


OG 


SER A 


10 


106 . 


778 


31 . 


A ft fl 
4 o o 


C ^ "T Q 


1 . 00 36.26 


ATOM 


90 


c 


SER A 


10 


108 . 


922 


29 . 


3 O J 


3 . 3U U 


1.00 32.91 


ATOM 


91 


o 


SER A 


10 


10 9 . 


o / *« 


2 Q 


fi Q C 

o o b 


6.416 


1.00 33.57 


ATOM 


92 


N 


GI*N A 




109 . 


300 


2 Q 


<5 fl 


4 . 225 


1.00 34.16 


ATOM 


94 


CA 


GLN A 




110 . 


O 3 .9 




Q1 9 


"J oil 


X . 00 33. 54 


ATOM 


95 


CB 


GLN A 


11 


110 . 


792 


29 . 


Ad. ^ 


9 9 fl 1 


1 nn 9 Q 9 Q 
X . u u 33.3^ 


ATOM 


96 


CG 


GLN A 


11 


112 

X X A • 






fiT n 
u / u 




± . U U 3 0 . z 3 


ATOM 


97 


CD 


GLN A 


11 


113 . 


164 


2 ft 


(52 n 


2 A4 A 


1 n n 9 Q OA 

X • U U 33. i7*k 


ATOM 


98 


OEl 


GLN A 


11 


112 . 


880 


27 . 


774 


X . / X D 


T n n 9 fl 09 
x.uu 30.7/ 


ATOM 


99 


NE2 


GLN A 


11 


114 . 


310 


29 . 


230 


2 . 624 


T no 9 <^ 2ft 


ATOM 


102 


c 


GLN A 


11 


110 , 


991 






*i * 3 X Q 


1 n n 9A <j n 
x.uu 30. ou 


ATOM 


103 


o 


GLN A 


11 


112 . 


116 


31 . 


559 


4 . 740 


T no 41 in 


ATOM 


104 


N 


LYS A 


12 


110 . 


013 


32 . 


216 


4 . 305 


1.00 35 . 51 


ATOM 


106 


CA 


LYS A 


12 


110 . 


245 


33 . 


573 


4 . 788 


1 00 31 96 


ATOM 


107 


CB 


LYS A 


12 


109 . 


. 005 


34 . 


. 445 


4 . 612 


1 00 32 24 


ATOM 


108 


CG 


LYS A 


12 


109 , 


. 226 


35 . 


. 876 


5 . 065 


1 00 32 71 


ATOM 


109 


CD 


LYS A 


12 


107 , 


. 953 


36 , 


. 685 


5 . 063 


1 00 31 87 


ATOM 


110 


CE 


LYS A 


12 


108 


. 227 


38 . 


. 085 


5 - 573 


1 00 34.67 


ATOM 


111 


NZ 


LYS A 


12 


106 < 


. 998 


38 


. 914 


5 . 627 


1.00 37.88 


ATOM 


115 


c 


LYS A 


12 


110 


.612 


33 


.498 


6 .259 


1.00 30 . 93 


ATOM 


116 


o 


LYS A 


12 


111 


.482 


34 


.227 


6 . 728 


1.00 32.37 


ATOM 


117 


N 


GLY A 


13 


109 


. 961 


32 


. 586 


6 . 973 


1.00 30.47 


ATOM 


119 


CA 


GLY A 


13 


110 


.231 


32 


.407 


8.385 


1.00 27.64 


ATOM 


120 


C 


GLY A 


13 


111 


.593 


31 


.791 


8.610 


1.00 29.49 


ATOM 


121 


o 


GLY A 


13 


112 


.312 


32 


. 184 


9.531 


1.00 30.35 






N 


LEU A 


14 


111 


. 952 


30 


. 817 


7 .779 


1.00 28.87 


ATOM 


124 


CA 


LEU A 


14 


113 


. 248 


30 


. 170 




1 on 2 R 9 


ATOM 


125 


CB 


LEU A 


14 


113 


.361 


28 


.941 


7.011 


1.00 25.60 


ATOM 


126 


CG 


LEU A 


14 


112 


.564 


27 


.698 


7.419 


1.00 25.99 


ATOM 


127 


CDl 


LEU A 


14 


112 


.959 


26 


.547 


6.531 


1.00 28.34 


ATOM 


128 


CD2 


LEU A 


14 


112 


.829 


27 


.330 


8.855 


1.00 23.90 


ATOM 


129 


C 


LEU A 


14 


114 


.359 


31 


.155 


7.613 


1.00 28.63 
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ATOM 


130 


0 


LEU A 


ATOM 


131 


N 


ASP A 


ATOM 


133 


CA 


ASP A 


ATOM 


134 


CB 


ASP A 


ATOM 


135 


CG 


ASP A 


ATOM 


136 


ODl 


ASP A 


ATOM 


137 


0D2 


ASP A 


ATOM 


138 


C 


ASP A 


ATOM 


139 


O 


ASP A 


ATOM 


140 


N 


TYR A 


ATOM 


142 


CA 


TYR A 


ATOM 


143 


CB 


TYR A 


ATOM 


144 


CG 


TYR A 


ATOM 


145 


CDl 


TYR A 


ATOM 


146 


CEl 


TYR A 


ATOM 


147 


CD2 


TYR A 


ATOM 


148 


CE2 


TYR A 


ATOM 


149 


C2 


TYR A 


ATOM 


150 


OH 


TYR A 


ATOM 


152 


C 


TYR A 


ATOM 


153 


O 


TYR A 


ATOM 


154 


N 


ALA A 


ATOM 


156 


CA 


ALA A 


ATOM 


157 


CB 


ALA A 


ATOM 


158 


C 


ALA A 


ATOM 


159 


0 


ALA A 


ATOM 


160 


N 


SER A 


ATOM 


162 


CA 


SER A 


ATOM 


163 


CB 


SER A 


ATOM 


164 


OG 


SER A 


ATOM 


166 


C 


SER A 


ATOM 


167 


o 


SER A 


ATOM 


168 


N 


GLN A 









14 


115.346 


31.207 


15 


114.176 


31.972 


15 


115.176 


32.975 


15 


114.695 


33,832 


15 


114.818 


33 .127 


15 


115.662 


32 .209 


15 


114.071 


33 .530 


15 


115 .473 


33 .882 


15 


116.626 


34 .246 


16 


114 .433 


34 .244 


16 


114 .598 


35.098 


16 


113 .239 


35.554 


16 


113 .320 


36.661 


16 


113 .859 


37.905 


16 


113 . 921 


38.936 


16 


112 . 846 


36.474 


16 


112.903 


37.498 


16 


113 .440 


38.724 


16 


113 .491 


39.733 


16 


115 .348 


34 .340 


16 


116 .240 


34.883 


17 


114.982 


33 .081 


17 


115 .632 


32 .250 


17 


115.025 


30 .870 


17 


117.118 


32 .180 


17 


117.957 


32.330 


18 


117 .447 


32.008 


18 


118 . 846 


31.927 


18 


118 .993 


31.399 


18 


118 .220 


32.127 


18 


119.605 


33.238 


18 


120 .768 


33.228 


19 


118 . 959 


34.367 
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8.335 1.00 31.86 

6.582 1.00 31.30 

6.215 1.00 33.98 

5.033 1.00 36.03 

3.681 1.00 36.52 

3.515 1.00 37.01 

2.765 1.00 36.25 

7.413 1.00 35.08 
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1. 


00 


37. 


07 


8.162 


1. 


00 


34. 


55 


9.337 


1. 


00 


34. 


48 


9.865 


1. 


00 


34. 


89 


10 .891 


1. 


00 


35. 


28 


10.562 


1. 


00 


36. 


36 


11 .496 


1. 


00 


34. 


26 


12 .182 


1. 


00 


33 . 


51 


13 .118 


1. 


00 


34. 


97 


12 .767 


1. 


00 


34 . 


08 


13 .697 


1. 


00 


37. 


78 


10 .427 


1. 


00 


35. 


.46 


11.074 


1, 


,00 


36 


.90 


10 .625 


1. 


.00 


36 


.20 


11.617 


1 


.00 


35 


.77 


11.615 


1 


.00 


33 


.78 


11.293 


1 


.00 


37 


.85 


12.178 


1 


.00 


40 


.31 


10.017 


1 


.00 


39 


.36 


9.612 


1 


.00 


42 


.40 


8.175 


1 


.00 


43 


.78 


7.236 


1 


.00 


48 


.26 


9.786 


1 


.00 


42 


.53 


10.178 


1 


.00 


44 


.52 


9.509 


1 


.00 


42 


.42 
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70 



ATOM 


170 


CA 


GLN A 


19 


119. 


630 


35 . 


652 


9.665 


1 . 


00 


42 . 


67 


ATOM 


171 


CB 


GliN A 


19 


118 . 


805 


36 . 


806 


9.089 


1 . 


00 


47. 


13 


ATOM 


172 


CG 


GLN A 


19 


118 . 


810 


36. 


864 


7.563 


1. 


00 


57. 


52 


ATOM 


173 


CD 


GLN A 


19 


118 . 


457 


38. 


238 


7 .012 


1 . 


00 


62. 


74 


ATOM 


174 


OEl 


GLN A 


19 


117. 


488 


38 . 


866 


7.440 


1 . 


00 


65 . 


88 


ATOM 


175 


NE2 


GLN A 


19 


119. 


248 


38 . 


710 


6.051 


1 . 


00 


64 . 


58 


ATOM 


178 


C 


GLN A 


19 


119. 


943 


35 . 


906 


11.126 


1. 


00 


40 . 


97 


ATOM 


179 


O 


GLN A 


19 


121. 


030 


36. 


378 


11.456 


1 . 


00 


41. 


85 


ATOM 


180 


N 


GLN A 


20 


119. 


008 


35. 


565 


12 .005 


1 . 


00 


39. 


51 


ATOM 


182 


CA 


GLN A 


20 


119. 


226 


35 . 


759 


13 .430 


1 . 


00 


37. 


61 


ATOM 


183 


CB 


GLN A 


20 


117. 


944 


35. 


507 


14 .228 


1 . 


00 


38. 


26 


ATOM 


184 


CG 


GLN A 


20 


116 . 


764 


36 . 


405 


13 .874 


1 . 


00 


38. 


33 


ATOM 


185 


CD 


GLN A 


20 


117, 


057 


37 . 


895 


14.002 


1 . 


00 


41. 


17 


ATOM 


186 


OEl 


GLN A 


20 


116 . 


912 


38. 


642 


13 .040 


1 . 


00 


43 . 


42 


ATOM 


187 


NE2 


GLN A 


20 


117. 


.441 


38 . 


335 


15.192 


1, 


00 


42. 


94 
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215 


25. 


918 


35 . 


227 


1 . 


00 


29. 


12 




ATOM 


4461 


0H2 


HOH 


A 


922 


79. 


054 


35. 


414 


40 . 


975 


1, 


00 


41 . 


95 




ATOM 


4464 


0H2 


HOH 


A 


923 


65 . 


692 


27. 


690 


51 . 


372 


1. 


00 


65. 


22 


10 


ATOM 


4467 


0H2 


HOH 


A 


924 


88. 


914 


24 . 


630 


4 . 


689 


1 . 


00 


48 . 


96 




ATOM 


4470 


0H2 


HOH 


A 


925 


100 . 


082 


22 . 


695 


34 . 


405 


1 . 


00 


48. 


58 


15 


ATOM 


4473 


0H2 


HOH 


A 


926 


76 . 


929 


19. 


455 


31 . 


644 


1 . 


00 


86 . 


93 


ATOM 


4476 


0H2 


HOH 


A 


927 


171. 


524 


31. 


723 


13 . 


880 


1 . 


00 


59 . 


94 




ATOM 


4479 


0H2 


HOH 


A 


928 


108 . 


006 


35. 


100 


24 . 


514 


1. 


00 


49 . 


31 


20 


ATOM 


4482 


0H2 


HOH 


A 


929 


141. 


049 


21. 


64 9 


5 . 


587 


1. 


00 


42 . 


12 




ATOM 


4485 


0H2 


HOH 


A 


930 


110 . 


883 


40. 


970 


11 . 


363 


1. 


00 


35 . 


61 


25 


ATOM 


4488 


0H2 


HOH 


A 


931 


74 . 


360 


34 . 


525 


47 . 


158 


1. 


00 


75 . 


14 


ATOM 


4491 


0H2 


HOH 


A 


932 


68. 


751 


40 . 


.894 


44 . 


809 


1, 


00 


43 . 


22 




ATOM 


4494 


0H2 


HOH 


A 


933 


59. 


.758 


29. 


.313 


23 . 


.460 


1 . 


. 00 


61. 


.69 


30 


ATOM 


4497 


OH2 


HOH 


A 


934 


95. 


.173 


16 , 


•341_ 


20, 


.293 


1. 


. 00 


48. 


.05 




ATOM 


4500 


OH2 


HOH 


A 


935 


111. 


.623 


41 


.720 


8 


.640 


1. 


.00 


53 


.02 


35 


ATOM 


4503 


0H2 


HOH 


A 


936 


105 


.604 


16 


.201 


11 


.554 


1 


. 00 


71 


.57 


ATOM 


4506 


OH2 


HOH 


A 


937 


97 


.160 


30 


.542 


3 


.465 


1 


.00 


58 


.62 




ATOM 


4509 


OH2 


HOH 


A 


938 


108 


.492 


10 


.703 


6 


.225 


1 


.00 


76 


.16 


40 


ATOM 


4512 


0H2 


HOH 


A 


939 


135 


.408 


44 


.678 


13 


.765 


1 


.00 


61 


.58 




ATOM 


4515 


0H2 


HOH 


A 


940 


91 


.469 


31 


.723 


41 


.685 


1 


.00 


41 


.59 


45 


ATOM 
ATOM 


4518 
4521 


OH2 
OH2 


HOH 
HOH 


A 
A 


941 
942 


164 
157 


.580 
.888 


32 
29 


.799 
.985 


18 
20 


.975 
.509 


1 
1 


. 00 
.00 


48 

63 


.97 
.61 




ATOM 


4524 


0H2 


HOH 


A 


943 


121 


.776 


21 


.112 


25 


.392 


1 


.00 


61 


.93 


50 


ATOM 


4527 


0H2 


HOH 


A 


944 


96 


.503 


38 


,299 


34 


.009 


1 


. 00 


64 


.71 




ATOM 


4530 


0H2 


HOH 


A 


945 


96 


.403 


17 


.036 


28 


.115 


1 


.00 


76 


.51 


55 


ATOM 
ATOM 


4533 
4536 


0H2 
0H2 


HOH 
HOH 


A 
A 


946 
947 


88 
81 


.114 
.217 


18 
23 


.204 
.098 


31 
12 


.407 
.608 


1 
1 


.00 
.00 


52 
35 


.33 
.93 




ATOM 


4539 


0H2 


HOH 


A 


948 


126 


.713 


24 


.917 


0 


.514 


1 


,00 


82 


.50 


60 


TER 
END 






































Atom 
Type 


Residue # 


X 




Y 








occ 


B 



nip 



.SB. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



wo 98/58961 



REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



REFINEMENT. 
PROGRAM 
AUTHORS 



176 



X-PLOR (online) 
BRUNGER 



PCT/US98/U007 



3 . 843 



DATA USED IN REFINEMENT 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA(F) 
DATA CUTOFF HIGH (ABS (F) 

DATA CUTOFF LOW (ABS(F)) 
COMPLETENESS ( WORKING+TEST ) (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT, 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 
ESTIMATED ERROR OF FREE R VALUE 

FIT IN THE HIGHEST RESOLUTION BIN 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE HIGH (A) 
BIN RESOLUTION RANGE LOW (A) 
BIN COMPLETENESS ( WORKING+TEST) (%) 
REFLECTIONS IN BIN (WORKING SET) 

BIN R VALUE (WORKING SET) 

BIN FREE R VALUE 

BIN FREE R VALUE TEST SET SIZE (%) 
BIN FREE R VALUE TEST SET COUNT 
ESTIMATED ERROR OF BIN FREE R VALUE 



2 .40 
50.00 
0.0 

100000 . 00 
0 .010000 
92 .7 
18908 



THROUGHOUT 
RANDOM 
0.225 
0.295 
10 .2 

1926 
0.007 



6 

2 .40 
2.55 
94.1 

2875 
0 .368 
0.455 

8.7 
275 
0 .027 



NUMBER OF NON-HYDROGEN ATOMS USED IN REFINEMENT, 



PROTEIN ATOMS 
NUCLEIC ACID ATOMS 
HETEROGEN ATOMS 
SOLVENT ATOMS 



3531 
0 

102 
46 



B VALUES. . 
FROM WILSON PLOT (A*^ 
MEAN B VALUE (OVERALL, A*^ 

OVERALL ANISOTROPIC B VALUE. 



'2) 
'2) 



45 
45 



Bll 
B22 
B33 
B12 
B13 
B23 



(A* 
(A* 
(A* 
(A* 



»2) 
*2) 
•2) 
•2) 



(A**2) 
(A**2) 



-11 . 35 
6.33 
5.02 
0 .00 
7 .70 
0 .00 



ESTIMATED COORDINATE ERROR. 
ESD FROM LUZZATI PLOT 
ESD FROM SIGMAA 
LOW RESOLUTION CUTOFF 



(A) 
(A) 
(A) 



0.34 
0.47 
5.00 



CROSS -VALIDATED ESTIMATED COORDINATE ERROR. 
ESD FROM C-V LUZZATI PLOT CA) : 0.45 

0 .46 



3 ESD FROM C-V SIGMAA 



(A) 



RMS DEVIATIONS FROM IDEAL VALUES. 
BOND LENGTHS (A) 
BOND ANGLES (DEGREES) 
DIHEDRAL ANGLES (DEGREES) 
IMPROPER ANGLES (DEGREES) 



0.007 
1.4 
26.1 
1.24 
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REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

SEORES 

SEQRES 

SEORES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SSBOND 

CRYSTl 

ORIGXl 

ORIGX2 

0RIGX3 

SCALEl 

SCALE2 

SCALE3 
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10 



15 



20 
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30 



35 



40 



45 



50 
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3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



ISOTROPIC THERMAL MODEL : RESTRAINED 
ISOTROPIC THERMAL FACTOR RESTRAINS 



MAIN-CHAIN BOND 
MAIN -CHAIN ANGLE 
SIDE -CHAIN BOND 
SIDE -CHAIN ANGLE 

NCS MODEL 

NCS RESTRAINTS. 
GROUP 1 POSITIONAL 
GROUP 1 B- FACTOR 



(A**2 ) 
(A**2) 
(A**2) 
(A**2) 



RMS 
1.77 



SIGMA 



00 
81 
53 



50 
00 
00 
50 



NONE 



PARAMETER FILE 1 
PARAMETER FILE 2 
TOPOLOGY FILE 1 
TOPOLOGY FILE 2 



(A" 

parhcsdx.pro 
paramll . wat 
tophcsdx.pro 
tophi 1 . wat 



(A) 
*2) 



RMS SIGMA/WEIGHT 
NULL ; NULL 
NULL ; NULL 



1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
36 
39 

1 
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^"^07 ''^L^'fo^'^.^^'^''^'- SOLVENT MODEL USED 

507 VAL ASN PRO GLY VAL VAL VAL ARG ILE SER GIji LYS GT Y 

r''^ nff. J^. GL? ^ leu G^ 

SER ?HE L^ 1^1 l"^ "^l^ ^™ SER A^ 

PHE ^ MP? Yll TYR SER 

ATS ARG GLU PHE GLN LEU PRO SER 

«o ^^"^ ASN VAL GLY LEU LYS PHE 

SER ILE SER ASN ALA ASN ILE LYS ILE SER GLY LYS TRP 

LYS ALA GLN LYS ARG PHE LEU LYS MET SER GLY ASN ME 

LYS LEU GLY l^l ^^JJ ^ 
LYS LEU GLY SER ASN PRO THR SER GLY LYS PRO THR ILE 

.^f S^i ^^'^ ILE ASN SER ^ HIS 

VAL HIS ILE SER ALA ALA SER VAL GLY TRP LEU ILE GLN 
LEU PHE HIS LYS LYS ILE GLU SER ALA LEU ARG ASN LYS 
MET ASN SER GLN VAL CYS GLU LYS VAL THR ASN SER VAL 
SER SER GLU LEU GLN PRO TYR PHE GLN THR LEU PRO VAL 
MET THR LYS ILE ASP SER VAL ALA GLY ILE ASN TYR GLY 
LEU VAL ALA PRO PRO ALA THR THR ALA GLU THR LEU ASP 
VAL GLN MET LYS GLY GLU PHE TYR SER GLU ALA ALA ALA 
ALA PRO PRO PRO PHE ALA PRO PRO VAL MET GLU PHE PRO 
ALA ALA ALA ASP ARG MET VAL TYR LEU GLY LEU SER ASP 
TYR PHE PHE ASN THR ALA GLY LEU VAL TYR GLN GLU ALA 
GLY VAL LEU LYS MET THR LEU ARG ASP ASP MET ILE PRO 
LYS GLU SER ALA PHE ARG LEU THR THR SER PHE PHE GLY 
THR PHE LEU PRO GLU VAL ALA LYS LYS PHE PRO ASN MET 
LYS ILE GLN ILE HIS VAL SER ALA SER THR PRO PRO HIS 
LEU SER VAL GLN PRO THR GLY LEU THR PHE TYR PRO ALA 
VAL ASP VAL GLN ALA PHE ALA VAL LEU PRO ASN SER ALA 
LEU ALA SER LEU PHE LEU ILE GLY MET HIS THR THR GLY 
SER MET GLU VAL SER ALA GLU SER ASN ARG LEU VAL GLY 
GLU LEU LYS LEU ASP ARG LEU LEU LEU GLU LEU LYS HIS 
SER ASN ILE GLY PRO PHE PRO VAL GLU LEU LEU GLN ASP 
ILE MET ASN TYR ILE VAL PRO ILE LEU VAL LEU PRO ARG 
VAL ASN GLU LYS LEU GLN LYS GLY PHE PRO LEU PRO THR 
PRO ALA ARG VAL GLN LEU TYR ASN VAL VAL LEU GLN PRO 
HIS GLN ASN PHE LEU LEU PHE GLY ALA ASP VAL VAL TYR 
LYS PC PC HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH 
HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH 
HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH 
HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH HOH 
135 CYS A 175 

33.000 85.200 90.00 101.60 90.00 C 2 4 
1.000000 0.000000 0.000000 0.00000 
0.000000 1.000000 0.000000 0.00000 
0.000000 0.000000 1.000000 0.00000 
0.005368 0.000000 0.001106 0.00000 
0.000000 0.030303 0.000000 0.00000 
0.000000 0.000000 0.011962 0.00000 



507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
A 507 
CYS A 
600 



